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Description 

Hepatitis B vims suitace antigens and hyt)rid antigens containing them 



FiELD OF THE INVENmON 

5 The Invention reiates to hepatitis B vims antigens and hybrid antigens containing them and to the cloning of 
genes which code for said antigens in yeast' by use of recombinant WA techniques. 

BACKGROUND OF THE IIWEIslTlON 

10 A. Hepatitis B vaccines 

Infection with hepatitis B vims (HBV) is a serious, widespread health problem. Infection can be manifested In 
acute or chronic phases, in the United States, the number of cases of acute hepatitis is estimated to be at least 
100.000 per year with a fatality rate of 1 to 2 per cent and fhe prevalence of chronic earners of HBV among 
healthy adults varies between 0.1 and depending on age and sodai ctess^ In South America, the 

IS prevalence of chronic carriers is about 1 to 3Q/b; in the U.S^J). and southern Europe, about 3 to 6%b; and In 
Asia and Africa, more tiian lOOfe. 

In developed countries, there exists a need for a vaccine for people at high risic of exposure, such as 
patients and personnel in medical units where blood Is handled, military personnel, spouses of chronic i 
oarriere. travelers to areas of high HBV endemlclty, newborns of chronic carriers, homosexuals, prostitutes 

20 and dmg abusers. In third wvorid countries, there exists a need for an Inexpensiva vaccine for mass 

Immunization. Mass Immunization programs may ultimately affect not only the Incidence of acute hepatitis and ^ 
the pool of chronic carriere but may also reduce the mort>ldity and mortality from chronic active hepatitis and 
hepatoceOular carcinoma. 

Dane particles, wiilch are believed to be hepatitis B virions and which are isolatable from Infected patients, 
25 have a diameter of about 42nm. Each consists of an envelope comprising the hepatitis B surface antigen 
(HBsAg), a capsid (HBcAg), an endogenous polymers^ and a DNA gencmie. The genome is circular and 
double-stranded with a single strand region consisting of about 200 bases. The single strand region can be 
filled In, in vitro, by the action of the endogenous polymerase. The longer strand contains approximately 3,200 
t>ases. 

30 For a long time, It has been difficult to prepare HBV vaccines because it has proven difficult to propagate the 
virus in tissue culture and because the on^ Icnown host Is man. However, chimpanzees are also able to be 
Infected In the ial>oratory with the virus. 

Valenzuela et al. (Nature, 347-350, 1982) report synthesis of HBsAg in yeast using an expression vector 
wherein the HBsAg coding sequence is an 835 base pair (bp) Taql-Hpal fragment and the promoter Is the yeast 

35 alcohol dehydrogenase I pronnotor. Several eariler brief reports noted research preceding this reference. 
These are Valenzuela etal. (Arch. Blol. ii^ed. Exp. (Chile), 14(1), 21-22, 1981) which reports expression In yeast 
of a DNA fragment containing a sequence which codes fbr a protein similar to HBsAg Ogated to a yeast alcohol 
dehydrogenase promoter region; a report In Scrip No. 616, p. 14 (Aug. 12, 1981 ), which states tiiat a team of U. ««r f 

S. researchere Including P. Valenzuela and W. J. Rutter have announced production in yeast of the 

40 protein-coating surrounding hepatitis B vims'; and Zudcerman (Nature, 98-99, 1982) which reports that 

W. J. Rutter has reported expression of glycosylated HBsAg in yeast celte. . ^ 

Antigenic components of HBV, such as HBsAg. are reported to have t>een prepared In t>acteria fbliowlng 
Insertion of a recombinant Di^ molecule containing a gene which codes for the antigen. Bunrell et al. (Nature, 
279, Number 5708, 43-47. 1979) report expression in E. coll HB101 of HBV Dl^ sequences cloned in plasmid 

45 PBR322. 

Munray et ai.. European Patent Application Publication Number 13,828. 1980, disclose preparation of a 
recombinant vector which can code fbr HBV antigens. Including HBsAg. In E. ^ strain HB101. The vector Is 
prepared from Dane particle DNA and plasmid pBR322. The authore state that useful hosts m^ Include other 
bacterial hosts, yeasts and other fungi, animal or plant cells and other hosts, although the only host 
50 exemplified In ttie application Is E. coll . 

Chamay, et al. (Nature, 893-895, 1980) report constmction of a bacteriophage carrying a fusion of the 
P-galactosldase gene and the HBsAg structural gene. The bacteriophage directs synthesis of a fiision protein 
comprising antigenic determinants of tx>th HBsAg and ^alactosldase. 

Tloilals et al., UK Patent Application Number 2,034,323. disclose preparation of a collphage containing HBV 
55 DNA Fused phage-HBV DNA is transformed Into E. coll strain C600. 

In UK Patent Application Number 2,070,621, a plasmid which comprises a part of the HBsAg gene and the 
promoter and the Z gene of the lactose operon and which can be cloned in E. odi Is disclosed. 

Rutter et al., European Patent AppOcation Publication Number 20,251. 1980, disclose recombinant vectora 
including a recombinant vector comprising plasmid pBR322 and BamHI fragments of HBV DNA, which can be 
60 used to transform E. c^. Another vector, comprising a BamHI fragment of HBV DNA and a portion of the 
tryptophan operon, was used to obtain expression in E. coll strain HB101. 

Edman et al. (Nature. Number 5815. 503^, 1981) describe constmction of plasmlds which direct ^ ^ 

synthesis of HBcAg and a p-lactamase-HBsAg fusion protein, under the control of the tryptophan operon 
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regulatory region, in E. coll. 

Pumpen et al. (Gene, OT. 201-210, 1984) dlsdose that small amounts of HBaAg monomer proteins and 
fusion proteins are synthesized In E. ooH usihg antibodies raised agalrist the denatured HBaAg monomer for 
detection. 

Other references disclosing Insertion of HBV DNA Into bacterfa Include Chamay, et al. (Prog. Med. Virol., 27 
88-92. (1981); MacKay et al. (Proc. Natl. Acad. Sd. U.S. TB, Number 7. 45104514, 1981); FrItsch et al. (CS! 
Acad. Sd., 2|7, Number 16. 1453.(1978) ; U. K. Patent Specification 2,034,323 (Derwent No. 48874C) and Pasek 
et al. (Nature, ^ No. 6, 575, 1979). 

HBV DNA has also been doned In mammalian cells. These Include, human, mouse, and human hepatoma 
cell lines. For example, Dubois et al. [Ptoc. Nail. Acad. Sd. U.S... 77, Number 8. 454W653. 1980) report 
transformation of mouse ceDs with a plasmld containing the HBV genome and expression of HBsAg* 
Hirschman et al. (Proc. Natl. Acad. Sd. U.S.. 77. Number 9. 5S07-«11, 1980) report production of HBV-IIke 
partides by HeLa ceils transformed with HBV DNA. 

Procedures for preparing HBV vaccine using HBaAg from human Wood are reported by FunakoshI et al 
(Prog. Med. Virol., 27. 163-167. 1981) and Maupas et al. (Prog. Med. VIroL, 27, 185-201. 1981). The vaccine 
prepared by FunakoshI et al. contains 40 Jig of purtfldd, fomialln-trealed HBaAg, phosphate, sodium chtoride. 
20 mg of mannltol; and 0.1<Vb of aluminum hydroxide as adjuvant In the latter paper. Maupas et al. report that 
one dose of vacdne was 1 mL of purified. fomnaDn-treated HBsAg containing 2-10 n/mL of prxrtdn (Lowry*s 
method) and 0.1% of aluminum hydroxide. The protocd used In the study reported by Maupas et al. called for 
three injecttons at one month Intervals with a booster after one year; the authors propose a protocd 20 
consisting of two Injections of concerttrated HBsAg at three month Iritervais, 

Addlttortal references to proparatloh of HBV vaccines indude Maupas et al.. Adamov^cz et al., and 
FunakoshI et al. at pages 3. 37 and 57. re^aectlveiy, of Hepatitis B Vaccine INSEflM Sympodum No. 1 fl, edU by 
Maupas and Quesry. 1981, Elsevier/North-Hoiland Biomedical Press. 

Yeasts have been used as host organisms for expression of certain other non-HBV DNA sequences. For S6 
example. Fraser et al., U. K. Patent Application 2.068.969, dlsdose preparation of chtekeh ovalbumin In yeast; 
Scrip No. 640, p. 11 (Nov. 4, 1981) contains a report that a type of Interferori Is being prepared In yeast In 
European Patent 1 1 ,562 (Denvent No. 387620) are reported t^rld yeast plasmlds containing the ura* yeast 
gene in the 2 |i plasmld. — ^ 

The authentic Hepatitis B Virus surfece antigen (HBsAg) can be reooviired from plasma of Infected 30 
individuals as a partide of about 22 nm compifsed of two proteins known as Pa4, and Its glycosylated 
derivative QP28, both of whkxh are encoded by the 226 amino add coding sequence 6n the HBV genome 
known as the S protdn ooding sequence: The entire 163 amino add coding sequence which Immediately 
precedes the 8 protebi coding sequence on the HBV genome Is refened to herein as the Pre S coding 
sequence. The 55 amino adds of the Pre S coding sequence which immediately pfecede the S protein coding 
sequence are refened to herein as the Pre S2 coding sequence, and the remaining 1(» amino adds of the Pre 
S coding sequence are refened to herein as the Pre SI coding sequence. The Pre S coding sequence, or any 
smaOer portion thereof. Is also refen^ to herein as the HBsAg precursor protein. 

For completeness, It should be noted that the entire pre S coding sequence fbr sohie HBV subtypes (e.g., 
ayw) is 1 63 codons, whHe the entire Pre S coding sequence for other HBV subtypes (e.g., adW2) Is 174 codons. 
In either case, the Pre S2 region Is the 55 codons Immediately preceding the S prxrtdn ooding sequ^ce. 

The Pre S2 coding sequence codes for ttie polyaibumin binding or receptor dte found on the surftoe of 
Dane particles and on ttie surface of some HBsAg partides isolated from aera of HBV-lnfMed p^erits. 
Although the Pre S2 region immediately precedes the S protein coding region on the HBV genoitid, the F»re fe2 
region Is not invoived In tiie assembly of HBsAg partides. See, e.g.. Persing et al. (Proc, Nidi. Acad. Sd. U.8 A. 
3440-3444, 1985). 

Valenzuda et ai. (Nahire, 347^, 1982) disclose that particles resembRng HBsAg weie found after 
disruption of yeast cells transformed with a vector containing the S protein codlrtg sequence, and conduded 
that HBsAg partides are synthesized In yeast. 

Harford et al. (Developments in Bk>logtoa] Standardization. 54, 125-139. 1983) disclose that partides so 
resembling HBsAg were found after dismption of yeast ceils transfonned wfth a vdctor contalnihg 18 amino- 
adds of the coding sequence of omittilne cart>amoyl transferase Immediately preceding the N-termjrfal 42 
amino adds of ttw Pre S2 coding sequence Immediately preceding the S protein coding sequence. 

Lflub et al. (J. Virdogy, 48(1), 271-280. 1983) dlsdose construction of a Simian Virus 40 early replacement 
vector tiiat has a large portion of HBV genome DNA Indudlng ttie Pre S^ protein coding sequence, and 
expression of such portton by SV40- transfonned CV-1 cells (COS ceUs) transfbrmed by such vector. 

Stibbe et al. (J. Virology, 46(2), eSM2B, 1983) dlsck>se ttial minor HBsAg glycoprtrtdns, named QP33 and 
QP36, are coded for by the Pre S2-8 protein coding sequence. Stibbe et al. (Dev. Btol. Stand.. 54, 33-43, 1983) 
dlsdose that HBsAg particles containing large amounts of QP33 and QP36 did not Induce higher ftntiS protein 
antibody titere than those HBsAg partides almost devdd of QP33 and QP36. 

Heerman et al. (J. Virol., K (2). 39W02, 1984) dlsdose tiiat tiw entire 389 amino add codlrig sequence 
comprising both tiia S protein coding sequence and ttie Pre S coding sequence codes tor a polypeptide. P39. 
which is found In HBV particles and viral surttee antigen filaments along wtth its glycosylated fomi. QP42. and 
ttie ottier HBV surface antigen assodated polypeptides P24. GP27. QP33 and QP38. 

Neuretti et d. (Sdence. 392^, 1984) disclose ttrnt polypeptides having ttie sequence of ttie 26 
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amtno-terminal amino acMs of the Pre-G2 coding sequence act as an Immunogen, and suggest that antibodies 
elicited by the immunogen can be utilized for diagnostic tests. 

Michel et al. (Proc. NatJ. Acad. Sd. U.S A. 81 . 7708-771 2. 1984) disclose synthesis of HBsAg canrying human 
searm polyalbumin receptors in Chinese hamster ovary (CHO) cells transfected with a plasmid carrying the 
5 Pre S2-S protein coding sequence. 

Parsing et aL (Proc. 1^. Acad. Sci. U.S.A., ^ 3440-3444, 1985) disclose that mouse L cells transfonned 
with a Pre S2-S protein coding sequence produce three HBsAg related polypeptides (24.0(X), 27,000 and 
35,000 daltons) all of which may be organized Into complex Immunoreactive HBsAg particles of 22 nm 
diameter which bind to polymerized human serum albumin (HSA); while mouse L cells transformed with the 

10 Pre S2-^ protein coding sequence bearing a frame-shift mutation near the end of the Pre S2 region produce 
only the 24,000 and 27.000 dalton polypeptides organized into 22 nm dameter immunoreactive HBsAg 
particles which are unable to bind HSA. Perslng et al. conclude that the Pre S2rS protein coding sequence 
encodes the 35.000 dalton species, that the Pre S2 protein accounts for the HSA binding activity of HBsAg but 
Is not required for assembly and secretion of the HBsAg particles; and that the major polypeptide of HBsAg 

IS (I-e., the 24,000 dafton species) Is not derived primarily by cleavage of larger precursors (Le., the 27,000 and 
35,000 dalton species) encoded by the Pre S2-S protein coding sequence. 

MlQch et al. (Science, 1 195-1 199. (1985) disclose tiiat vacdnes tiiat contain HBV particles wRh botii Pre 
S2 and HBsAg, prepared from Chinese Hamster Ovary (CHO) ceils transfected with a plasmid containing the 
Pre 32-3 protein coding sequence, can circumvent nonresponsiveness In certain mice to vaccines which just 

20 contain only HBsAg; and that the 26 amino acid residues at the NH2-tennlnus of the 33,000 dalton polypeptide 
coded for by the Pre S2-8 protein coding sequence represent a dominant antibody binding site on the Pre S2 
retfon. 

Michel et al.. "Vaccines 88". Brown et aL, Ed^ Cold Spring Harbor Laboratory (1986), 356^. discuss ttie 
syntiresls In CHO cells of Hepatitis B surfoce antigen particles containing the Pre S2 region expression 
25 product 

Neuratti et al. (Nahjre, 315, 154-156. 1985) disclose that the Pre S region codes for proteins on the HBV 
envelope with domains specrficaliy recognized by Over cells; the Pre S2-8 protein coding sequence codes for a 
protein present in HBV particles; synthetic peptides conBsponding to the gene encoding Pre-S proteins are 
immunogenic; and conclude that HBV vacdnes should contain Pre S determinants. 

30 Valenzuela et ai. (Biotechnology, 3. 317-320, 1985} disclose the expression of the entire pre S2-S protein 
coding sequence in yeast transfonned with such coding sequence ; arKl also disclose that such yeast cells do 
synthesize a particle cont^nlng both HBsAg and Pre S2 which is very similar In electronmicroscopy and 
sedimentation properties to particles containing only HBsAg, and that the Pre-S2 region does not Infer the 
ability to form particles resembling the 22 nm HBsAg particles. Valenzuela et al. also disclose that It has been 

35 hypothesized that HBV gets into the liver by binding polyalbumln to Its polyalbumin receptor which In turn 
binds the polyalbumln receptor to the liver cell and thus, the virus gets intenraflzed, and that the polyalbumln 
receptor of the HBV is encoded by the Pre S2 region. Valenzuela concludes that a vaccine containing Pre S2 
will eOdt antibodes which. In addition to Inactivating HBV through the normal mechanism, might interfere with 
the w^ the virus enters liver cells. 

40 Takeda Chemical Ind. KK, European Patent Application Publication No. 171.908-A, claims the production of 
non-glycosylated hepatitis-B virus surface antigen P31 protein in yeast 

Chiron Corporation. European Patent Application Publication No. 0,174.444 A2. claims a method for 
preparing HBsAg particles containing P31 protein In yeast 
The following Table I compares known Pre S2 region amino add sequences witti the HBV pre S2 amino add 

45 sequence of the subject Invention : 
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Table I Comparison of amino acid sequence of arm 
region from different serotypes of HBV 
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B. Malaria Vaccines 

Numercus recent studies suggest that protective Immunity In a sensitive host to infection by the sporozoite 
stage of a particular species of Plasmodium can t>e mediated via antibodies produced by such host to the 
IS circumsporozoite protein (CS protein) of the particular sporozoite. 

The sporozoite proteins of some types of Plasmodium have been cloned and some of those fiave been 
expressed by recombinant DNA techniques. 

Nussenzwelg et al. (U.S. Patent 4,466.917) disclose a sporozoite polypeptide, identified as the P-44 protein, 
and its cloning and expression In E. ^11 
20 Sharma et aL (Science, 879-882, 1985) disclose the expression of a portion of the circumsporozoite 
(CS) protein of Rasmodlum knowiesi In yeast by employing an expression vector containing the S' regulatory 
region of the yeast alcohol dehydrogenase I gene in place of the upstream re^on of the P. knowiesi CS gene 
sequence. 

New York University, PCT application PubDcation Nuntber W084'02922-A. disclose cloning of a portion of 
25 the coding region for the P. knowiesi circumsporozoite (CS) protein repeat urUt and expression of 
beta4actamase ar)d t>eta-€a]actoslctose fusions thereof In E. 

Kemp et al.. PCT Patent Application Publication Number W08402917-A. disclose cloning and expression In 
E. ^{i of a coding sequence for the CS protein derived from the blood stage of P. fatelpanim. 
Dame et aL (Science, 225 . 593, 1984) report cloning and expression of the CS protein of P. falciparum In E. 
30 coil. The protein Is described as comprising aboxA 412 amlrK> ackls with an approximate molecular weight of 
44,000. It comprises 41 tandem repeats of a tetrapeptide. Synthetic 7-. 11- and 15- residue peptides derived 
from the repeat region kx>urKl to monoclonal antibodies raised against the CS protein. 

Blis et al (Nature. 9^. 536-538, 1983) report expression of a t)eta-lactamase - P. knowiesi CS protein fusion 
In E coll. 

S$ Enea et al. (Proc. Natl. Acad, Sd.. USA. 81. 7520-7524. 1984) report an analogous repeat unit structure 
wittiln the CS protein of P. cynomolgl. and the ck>nlng and expresskm of the CS protein in E. ooH as well as a 
disclosure of its coding sequence. 

Banou et al. (Science. 996-099. 1985) disclose that synttietic peptides of the repeating region of ttte CS 
protein of Plasmodium falciparum conjug^ed to bovine senim albumin (BSA) or to thyrogtobufin produced 
40 antibody responses in mice and rabt^ and conclude that a vacdne against the sporozoite s^age of the 
malaria parasite can be developed tyy using ^nthe tic peptides of the repeating re^n of the CS protein 
conjugated to a carrier protein. 

Weber et al. (Molecular and Bacterial Parasitology. 15, 305^6. 1985) disclose the use of a cloned CS 
protein gene of a Brazilian strain of P. falciparum as a probe to analyze the s^tructure of 17 other P. falciparum 
4S strains by nucleic acid hybridization, and conclude that the CS protein gene Is highly conserved and that 
malaria vaccine development vi^ the CS protein is unlikely to be complicated by strain variation. 

Amot et al. (Science. 815-818. 1985) disclose the ctoning of the CS protein of P. vivax and disclose Ite 
entire cocfing sequence and conclude that the coding sequence of the CS protein of P. vivax is homologous to 
that of the CS gene of P. knowiesi but not P. falciparum. 
SO Shanna et al. (Sdenoe, 229, 779-782, 1985) disclose the complete nucleotide sequence of the coding region 
of the CS gene of the Nuri strain of P. knowiesi 

Young et al. (Science, 958-962. 1985) disclose the expression of P. faldpanim CS proteins In E con for 
potential use in a human malaria vaccine. 
The Walter and Eliza Mali Institute of Medical Research, PCT Patent AppUcation PubDcation No. 
SS W084/Q2917. 1 984. disclose artificially constructed polynucleotide sequences substantially conrespondlng to 
all or a portion of P. falciparum mRNA or genomic DMA; the peptide corresponding to such sequence and a 
method for production thereof; a composition for stimulating immune responses against P. falciparum 
antigens In a mammal comprising such peptide. 
Mazier et al. (Science, 156-159, 1986) disctose that antibodies raised in mice Immunized witti several 
60 recombinant and synthetic peptides of the circumsporozoite protein of P. falciparum were evaluated for 
protective activity in a human hepatocyte culture system and were found to exhibit a protective effect against 
the parasite at tiirae pointe: sporozoite attachment to the hepatocyte surface, entry and subsequent 
intracellular devetopment. 
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C. Polyvalent vaccines 

Valenzuela et al. (Biotechnology, 3, 323^. 1985) disclose the use of the HBsAg polyalbumln receptor 
9 coded for by the Pre S2 coding sequence as a means to prepare potyvalent vaccines. Valenzuela et al. also 

disclose preparation of a hybrid HBsAg-Herpes simplex 1 virus glycoprotein D (FiSV-lgD) particle by 
expressing a hybrid HSV-lgD-Pre S2-S protein coding sequence In yeast 5 

Valenzuela, 'Hepatitis B subuntt vacclries using recombinant DMA Techniques,* Me^ 15, 1985 
Blo-Expo-5-Meeting, Boston Massachussetts, discloses that with a hybrid such as the HBsAg-HSV-lgD 
particle described above, there Is a problem of immunodomtnance of the HBsAg over the foreign Immunogen 
presented on its surface. 

Chiron Corporation, European Patent Application Publication No. 0,175,261 claims a novel hybrid particle io 
containing at least a major portion of the Hepatitis B surface antigen fused to one or more oligopeptides, 
where the hybrid polypeptide Is capable of forming a particle in a cellular host with presentation of at least one 
epitope of at least one oligopeptide. 

Proteins can undergo post-translational modifications and some of these can profoundly affect the protein's 
conformation and function. The presence of oligosaccharide chains on human derived preS1-preS2-6 and 16 
preSZ-S protein (Heerman et al. In J. Virol., 62, 398-402. 1984; Stibbe and Gerllch In Virology, 1^, 436- 442, 
1982; Stibbe and Qertlch In J. Virol., 46, 626-628, 1983) and presumably of myristlc acid on the preSV preS2-S 
protein (Perslng et at, Abstracts of papers presented at the 1986 meeting on Molecular Biology of hiepatitis B 
viruses. August 28-31, 1986, Cold Spring Hart>or l^aboratory) might Influence particle formation, conformation 
^ of the proteins in the particle and the antigenicity and Immunogeniclty of these particles. a? 

Yeast (Saccharomyces cerevislae) has the enzymatlccapabillty for protein giycoaytalion (BaAou, The 
Molecular Biology of the Yeast Saccharomyces, MetaboOsm and Gene Expression'; Strathem, Jones and 
Broach Ed., Cold Spring Harbor Laboratoiy (1982). p. 33&3e0) and fatty acid acytatlon fTowler and QIasler in 
« Proc. Natl. Acad. Sd., 83, 2812-2816. 1986) similar to those of higher eukaryotic celto. 

Viral surface glycoproteins expressed In yeast have been found to be glycosylated. Jabbar et al. (Proc. Natl. 25 
Acad. ScL, B2, 2019-2023, 1985) disclose that expression of the influenza viral hemagglutinin gene In yeast 
resulted In a glycosylated hemagglutinin protein. Wen and Schleslnger (Proc. Natl. Acad. Sd., ^, 3639^643, 
1986) disclose that expression of Slndbis and Vesicular stomatitis virus glycoprotelris In yeast gave 
glycosylated forms of these viral proteins. Fujisawa et al. (At)stracts of papere presented at the 1986 meeting 
on Molecular Biology of Hepatitis B viruses. August 28-31. 1986, Cold Spring Hsirbor LiJ^oretory, page 6S) 30 
disclose that expres^n of the preS2-S gene in yeast results In the synthesis of two glycosylated forms of the 
preS2-S protein. Kniskem et al. (Abstracts of papere presented at the 1986 meeting on Modem Approaches to 
New Vacdnes, Induding Prevention of AIDS. September 9-14. 1886. Cold Spring Harbor Labo retory. page 89) 
disclose that expression of the preS1-preS2-8 gene In yeast results In the synthesis of two preS1-preS2-S 
proteins of 45 kD and 39 l<D. Pereing et d. (Abstracts of papere presented at the 1986 meeting on Molecular ss 
BIdogy of Hepatitis B viruses. August 28-31. 1986. Cold Spring Harbor Laboratory, page 19> dlsdose that 
labeflng of C0S7 ceDs expressing preS1-preS2-S, preS2-8 and 8 proteins witti 'Hmyrfstic add results In the 
presence of labd In ttie preS1-preS2-S protein. 

Fusion proteins expressed In yeast arxi reassembled In a particle are purified by the general scheme : 
« Aerosn adsorption and desorption (1) CaCIa predpitatlon (2) phenylagarose or phenyiboronateaga- 40 

rose adsorption and desorption (3) -i^ CsCI gradient centrlfugation or DEAE-lon exchange chromatography. 

Step 1 Is described in EP application publication 0 204 680 and step 3 Is described In EP application 
publication 0 199 698 as far as phenylagarose is concenned. The specific combination of step (1 ) and (3) is able 
^ to perfonm a drastic clean-up of contaminating material (eDmlnation of protein, carbohydrates and 50(Vb 

lipids). This novel combination allows the good functioning of the next chromatogFaphlc steps. 4S 

In case tttat fusion protdns are glycosylated, phenylboronate (PBA) is used as an afflnify adsorbent At pH 
9.0 the boronate group wakaa covalent* complexes with potential cis^lioi groups on the glyqosylatad 
sidechains. PBA gels have been used occasionally for the purfficatton of soluble glycoprotalns but not for the 
purification of particles containing glycoproteins embedded In ttte lipid bllayer (See, Cook, et al., Analytical 
Btochemlstry. 149. 349-353, 1985; Middle, et al.. Bkxshemical Journal. 771-779, 1983; Cook, et al., 50 
Blochlmica et Blophysica Acta. 205-212. 1985; Maestasane. et al.. Journal of Chrt)matogrBphy. 189 , 
225-231 . 1980). Because of the multiple iigand binding sites on ttie partide. PBA gds witii very low PB-Hgand 
density have to be used, e.g., PBA gels with boronate content < lOjig/ml gel, preferabV 5>ig boronate/ml gel. 
The use of phenylboronate gels with very low boronate content allow affinity chromatography of partides 
containing embedded gtyooprotelns. 55 

SUMMARY OF THE INVENTION 

This Invention relates to a recombinant DNA molecule comprising a functional DNA coding sequence fused, 
In phase, to a portion of the Pre S2 region of a Hepatitis B virus (HBV) Pre S2-^ protein coding sequence. The 
tenm 'coding sequence' or 'coding region' as used herein also encompasses any functional derivative €0 
thereof. By the term 'functional derivative* Is meant a coding sequence with amino add alterations which, 
where appropriate, dont Interfere with partide formation and/or which retain Immunogeniclty where such 
retention Is desirable. Such functional derivative can t>e prepared by conventional site spedflc mutagenesis 
^ such as by the methods disclosed by Botsteln et al. ( Science, 229 . 1193-1201, 1985. Preferably such DNA 

molecule also comprises a regulatory region which Is preferably derived from the yeast arg^ gene. es 
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This Invention also relates to a recombinant vector which comprises a recombinant DNA molecule 
operatlvely linked to a regulatory region, wherein the DNA molecule contalris a functional DNA coding ^ 
sequence fused, in phase, to a portion of the Pre S2 region of a HBV pre S2-S protein coding sequence. By the 
term 'regulatory region' as used herein Is meant a DNA sequence which contains a promoter region and other 
5 sequences necessary for transcription and translation of a coding sequence. 

This Invention also relates to a eukaryotic host cell transformed with a recombinant vector, wherein said 
vector comprises a DNA molecule operatlvely linked to a regulatory reckon, and wher^n saM DNA molecule 
contains a functional DNA coding sequence fused, In phase, to a portion of the pre S2 regton of a HBV Pre 
S2-S protein coding sequence. This Invention also relates to a method of preparing such transformed tiost cell 
10 which comprises transforming a eukaryotic host cell with such recombinant vector. 

This invention also relates to a method for preparing a hybrid particle containing HBsAg protein which 
contains and/or presents a peptide encoded by afunctional DNA coding sequence, which comprises culturtng 
a eukaryotte host cefl transformed with a recombinant vector In appropriate culture, media and isolating the 
particle form a cell lysate or extract of such host's culture, wherein said vector comprises a DNA molecule 
IS operatlvely Qnked to a regulatory region, wherein the DNA molecule contains the functional DNA coding 
sequence fused; in phase, to a portton of the Pre S2 region of a HBV Pre S2-8 protein coding sequence. 

ITils Iriventton also relathm to a hybrid immunogenic partide TO 
(HBsAg) particle whk:h contains and/or presents a peptide encoded by a functional DNA coding sequence. 

This Inventton also relates to a vaccine comprising an bnmunoprotectlve amount of such hybrid partteles. 
2D This invention also relates to micelle comprising such hyl>rid parflde and polysoritMtto and to a vaccine * 
comprising an Immunoprotective amount of such micelles. 

By the term "functional DNA coding sequence' is meant a DNA coding sequence which, when fused, in 
phase, to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence. Is not Involved with, and ^ 
does not Interfere with, the assembly of the HBsAg particle. 
25 Prefsnred functional DNA coding sequences include, but are not limited to, (a) the entire HBV Pre S protein 
coding sequence, or an Immunogente derivative thereof, such as. but not limited to, the Pre S2, Pre SI or Pre 
Sl-Pre S2 protein coding sec^ence; and (b) other immunogenic coding sequences such as the coding 
sequence of the drcumsporozolte protein of Plasmodium, or any Immunogenic derivative thereof, the 
envelope gene coding sequence of HIV or any Immunogenic derivative thereof, especially the C? peptide, 
30 peptide 121 or Drsesman peptide coding sequence or an Immunogenic derivative thereof. 

This invention also relates to a HBV Pre SI protein coding regkm which codes for the following amino add 
sequence: 

-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Th£ Asn Leu Ser Val Pro Asn Pro Leu 

40 * 
GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 

Gly Phe Phe Pro Asp His Gin Leu Asp Pro i 

46 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

so 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro He Lys Asp His Trp 

55 



60 



66 
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CCA 6CA GCC AAC CA6 GTA GQA OTG G6A GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly He Leu Gly Trp Set Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Ser Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AQA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 



10 
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so 



ss 



or to a HBV Pre S2 protein coding region or to a HBV Pre S2-S protein coding region wherein the Pre S2 
portion thereof codea for the foDowtng amino add aequenoe: 

-55 -50 ^ 

Met-Gln-Trp-Asn-Ser-Thr-Ala-Phe-Hls-Gln-Ala-Leu-Gln-Asp- 

« -40 ,30 

Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pro-Ala-Gly-Gly-Ser-Ser- 4s 
Ser- 

-20 

Gly-Thr-Val-Asn-Pro-Ala-Pro-Asn-Ile-Ala-Ser-His-Ile-Ser- so 
-10 

Ser-Ser-Ser-Ala-Arg-Thr-Gly~Asp-Pco-Val-Thr-Asn; 



a recombinant DMA vector comprising such Pre SI. Pre 82 or Pre S2^ protein coding region operatlvely 
linked to a regulatory region; a transfomied host cell containing such vector; a process for preparing such 
transfonned cell which comprises transforming a host cell with such vector; the protein coded for by such 
coding region; a process tor preparing such protein; a vaccine comprising an Immunoprutectlve amount of 
such protein, the partfda code for by such Pre S2-S protein coding region; a process tor preparing such 
partide. a vaccine comprising an immunoprotecth/s amount of such particle, and a micelle comprlsir^g such 
particle and polysort>ate. and a vaccina comprising an Immunoprotectlve amount of such micelles. 

This Invention also relates to a recombinant DNA vector comprising an entire Pre 81-Pre S2-S protein 
coding sequence or PreSl^nctkmal DNA coding sequence4>roS2-S protein coding sequence, or 
Immunogenic derivative thereof, operetlvely linked to a regulatory region; to a yeast hoet ceii transfonned with 



ss 
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such vector; to a method of preparing such transformed host which comprises transfbmdng a yeast host cell 
with such vector; to a method of preparing protein encoded by the Pre Sl-Pre S2-^ protein coding sequence, 
or Pre^^^nctionaJ DMA coding sequence - Pre S2-S protein coding sequence, or an Immunogenic derivative 
thereof; to the protein prepared tsy such method; to a vaccine comprising an immunoprotective amount of 
5 such protein; to a particle containing polypeptide encoded by the Pre S1-Pre S2-S protein coding sequence or 
Pre SI -functional DMA coding sequence - Pre S2-S protein coding sequence or an immunogenic derivative 
thereof; to a method of preparation thereof, a vaccine comprising an immunoprotective amount of such 
particles, a micelle comprising such particle and polysorlMCte, and a vaccine con^xlsing an immunoprotective 
amount of such Micelles. 

10 This Invention also relates to a Pre Sl-Pre S2 protein coding sequence or Pre Sl-Pre S2-S protein coding 
sequence of which the Pre Sl-Pre S2, portion comprises the foDowing amino add sequence: 

rRE-Sl REGION 

-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Set Val Pro Asa Pro Leu 



1$ 



20 



25 



so 



ss 



40 



45 



so 



60 



GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp P£o 

QCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro lie Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr vai Ser Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 



10 



« 



A66 CAQ CCT ACT CCC 
A£g Gin Pro Ttir Pro 

AGA GAC ACT CAT CCT 
Arg Asp Ser His Pco 

PRE^S2 REGION 
-55 

ATG GAG T66 AAT TCC 
Met Gin Tip Asn Ser 

GCT CTG GAG GAT CCC 
Ala Leu Gin Asp Pro 

TAT TTT CCT GCT GGT 
Tyr Phe Pro Ala Gly 

ACA GTA AAC CCT GCT 
Thr val Asn Pro Ala 

CAC ATA TCG TCA AGO 
His lie Ser Ser Ser 
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ATC TCT CCA CCT CTA 
lie Ser Pro Pro Leu 

CAG GCC 
Gin Ala 



ACT GCC TTC CAC CAA 

Thr Ala Phe His Gin 

AGA GTC AG6 GGT CTG 
Arg Val Arg Gly Leu 

GGC TCC A6T TCA GGA 
Gly Ser Ser Ser Gly 

CCG AAT ATT GCC TCT 
Pro Asn lie Ala Ser 

TCC GCG AGG ACT GGG 
Ser Ala Arg Thr Gly 



GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn 



This invention also relates to a recombinant vector comprising such novel Pre S1-Pro S2 or Pre S1-Pre S2-S 
protein coding sequence operatlvely linked to an expression control sequence; a host cell transformed with 
such vector; a method of prepartng such transformed host which comprises transforming a host cell with such 
vector; to a method of preparing protein encoded by such Pre Sl-Pre 32 or Pre S1-Pre S2-S protein coding 
sequence, or an immunogenic derivative thereof; to the protein prepared by duch method; to a vaccine 
comprising an Immunoprotectlve amount of such protein; to a particle containing polypeptide encoded by 
such Pre Sl-Pre SS^ protein coding sequence or an immunogenic derivative thereof; to a method of 
preparation thereof, a vaccine comprising an Immunoprotectlve amount of such particles, and a micelle 
comprising such particle and polysorbate. 

The following represents the key to the amino add abbreviations used In this applk»tion: 
Asp Aspartic add 
Thr Threonine 
Ser Serine 
Glu Giutamk) add 
Pro Proline 
Gly Glycine 
Ala Alanine 
Cys Cysteine 
Val Valine 
Met Methtonlne 
lie Isoleucine 
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Leu Leucine 
Tyr Tyrosine 
Phe Phenylalanine 
His HIstldine 
Lys Lysine 
Arg Arglnine 
Trp Tryptophan 
Gin Qlutamine 
Asn Aspara^ne 

BRIEF DESCRIPTION OF THE FIGURES 

Rgure 1 is a restriction endonuclease cleavage map of pRm0601. 
Figure 2 is a restriction endonuclease cleavage map of pRm(^16. 

Figure 3 Is a restriction endonudease cleavage map of pMC200. ^.^^^ 
Rgure 4 is the nucleotide sequence of a portion off the 3300 bp] HIndlll yeast DNA Insert In pMC200 
which portion contains Hlncll, Bglll and EcoRI sites. 
RgureSisaflow^eetinustralingpreparatlonof pRrT10671 andpRITIOOTS. 
Rgure 6 is aflow sheet iilustrating preparation 35 of pRITI 0749. 

Figure7lsaflow8heetlliustra(tingpreparationofpRrri0761andpRrn0764. 

Figure 8 Is a flow sheet iflustrating preparation of pRmoW (Example 1). pRrri0158 and pRrT10162 
(Example 3); pRrn0158. pRin0677. and pRm0909 (Example 4); pRrri0911 (Example 5); P^^^^ 
pRIT1(»D3. PRIT10914 (Example 6); pRIT12211. pRrn2209 and pRm2230 (Example 7); and pRrT12288 
and pRrn2322 (Example 8). 

Rgure 9 Is a restriction endonuclease cleavage map of P^^^^}^ 

Rgure 10 Is a restriction endonuclease cleavage map of pRm2290. 

Rgure 11 Is a restriction endonuclease cleavage map of pRrri2322 wWch Hlustrates which portions 
thereof were derived from pRm2230 and pRin2288. ^^rr<n^A 

Rgure 12 Is afiow sheet niustrating preparation of pRITI^I ; pRIT12573; pRrri2572and pRITIZSM. 

Rgure 13 is a restriction endonuclease map of pRrT12309. 

Rgure Ulsafiow sheet illustrating preparation of pRrT12314andpRfn2544. 

Rgure 16 is afiow sheet Illustrating preparation of pRITI 2658. 

Rgure 16 Is a restriction endonuclease cleavage map of pRm2G62. 

F^re 17lsaflow8heet illustrating preparation of pRrT12660. 

Rgure 18lsaflowsheetHlustratingpreparationofpRm2845(a,b.c.d.e.f). 

Rgure 1A isafiow sheet iUustrattng the preparation of pRm2662. 

Rgure 2A shows a schematic restriction map and sequencing strategy of clone XMPfl. 

B^. a A «.K#^M» 4i%A ni ir4<M^Mrio eoni ionr*A nf th A circumsnorazolte Duotein oene from i 



Rgure 4A shows the Pr9S1-PreS2 DNA coding sequence comprised by pRIT12792. 
Rgure 1B Isa schematic restriction map of the HIV B virus envelope region. 
Rgure 2B illustrates the construction of plasmid pRIT12893. 
Rgure 3B Illustrates the constructon of plasmid pRIT12897. 
Rgure 4B illustrates the construction of plasmid pRrT12899. 
Figure SB IDustrates the construction of plasmid pRflX. 



DETAILED DESCRIPTION OF THE INVENTION 



EXPRESSION OF HBsAg IN YEAST 

By -regulatory region' as used herein is meant a DNA sequence which contains sequences necessary or 
desirable for transcrption and translation of a coding sequence. Such regulatory retfons are generally placed 
5' and adjacent to the coding sequence which Is to be expressed. Prefarably a further region of yeast DNA 
containing a signal for the tenmination of transcription Is placed Immediately 3' to the coding region to be 
expressed so as to effect termination of the transcript 

By "HBsAg" as used herein Is meant an antigen containing proteins encoded by tiie S protein coding 
sequence of tlie HBV genome, vifherein such HBsAg is structurally similar to auttientic HBsAg or has 
substantially the same S protein coding sequence encoded antigenic detennilnants as authentic HBsAg. that 
Is. is capable of stimulating an Immune response v*rhich sped ficaiiy recognize and react witti authentic HBsAg 
or is spedflcaily recognized by anti-HBsAg antibodies. 

The HBsAg cocfing sequence can be Isolated from DNA extracted from Dane particles In Infected human 
serum by filling in ttie single strand region wWi a DNA polymerase, preferably the endogenous poiymerase, 
followed by digestion wfth a restriction endonuclease. The choice of endonuclease will depend. In part, on the 
particular Dane particles. For example, the HBsAg coding sequence of HBV DNA of certain Dane particles of 
the adw serotype can be Isolated on a single BamH I fragment: the HBsAg coding sequence of HBV DNA of 
certeirT ^e particles of the serotype can be Isolated on a Hhai fragment. HBV DNA of Dane particles of 
the same serotype may also exhibit different patterns of restriction sites. 

Restriction of DNA to prepare DNA fragments used In the invention, ligation of such fragments to prepare 
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recombinant DMA molecules used In the Invention and Insertion into microorganisms are canried out by leiown 
techniques such as techniques dlsdosed In the previously and subsequently cited references. Conditions are 
^ selected to avoid denaturatlon of the DNA and enzymes. For example, the pH Is buffered to remain at about 7.0 

to 1 1 .0 and the temperature Is kept below about 60" C. Preferably restriction Is carried out at a temperature of 
about 30 to 40*^0 and ligation is carried out at about 0 to 10^ C. Restriction enzymes and ligases used In 5 
carrying out this invention are commerclslly available and should be used In accordance with Instructions 
Included therewith. T4 DNA llgase Is the preferred llgase. 

The various fragments and final constructions may be lolned together In accordance with conventional 
ways. In many cases, genes have been Isolated and res trictipn mapped, as well as sequenced. To that extent, 
one can select the sequence of interest, such as the HBsAg sequence, by restriction of the gene, employing 10 
furttier manipulation as necessary such as resection with Bald1 . In vitro mutagenesis, primer repair, or the like, 
to provide a fragment of a desired size, Including ttie desired sequence, and having the appropriate termini. 
Linkers and adapters can be used for Joining sequences, as well as replacing lost sequences, where a 
restriction site employed was Internal to the region of Interest. The various fragments which are Isolated, may 
be purified by electrophoresis, electroeiuted. Ilgated to other sequences, cloned, relsoiated and further 15 
manipulated. 

The use of regulatory regions of controlling transcription of the structural gene of Interest, such as the 
HBsAg sequence, may allow for growing the host cells to high density with no or low levels of expression of the 
structural gene, such as the HBsAg sequence, and then Inducing expression by changing the environmental 
conditions, e.g., nutiient, temperature, etc. 20 
* For example, with the QAL4 regulatory region, the yeast ceils can be grown in rteh medja with a 

glyceroHactic add combination to high density, e.g.. mkJ or tale log phase, folkawed by switching the carbon 
source to galactose. As another example, for PH05 regutation, one couki grow the cells at a high phosphate 
4 concentration of about 7 mM KH2P04. and then recover and resuspend them In medium lacking phosphate to 

effect derepresston and expression. For temperature sensitivity, one could grow the ceils at 25** to 37^0 and 25 
then change the temperature as appropriate by about 5*" to 20°C. The host cells woukl have the regulatory 
system associated with the regulatory region empbyed. 

Fusing of the HBsAg sequence to the regulatory rBgk>n can be accomplished by use of Intermediate 
vectors. Alternatively, the HBsAg sequence can be Inserted directiy Into a vector whtoh contains the regulatory 
region. A vector is DNA which can carry and maintain the DNA fragment the Invention. Including, for example, 30 
phages and plasmlds. Techniques for cloning DNA fragmente In phages are diseased, for example, by 
Chamay et al. Nature , Volume 286. 893-805 (1880) and Tlollals, United Kingdom Patent Application 2,034,323. 
Preferably, the HBsAg sequence Is positioned relative to the regulatory region such that the HBsAg 
synthesized by expression of the HBsAg sequence Is devoid of extraneous amino add residues. 

In one embodiment of this invention the HBsAg coding eequence togettier with 42 oodons of the HBsAg 35 
precursor (preS^) protein were fused in phase after the 18th amino add of the coding sequence of the yeast 
OCT protein which is transcribed from the arg^ promoter. This construct gives a hybrid partide containing 
HBsAg protein containing an additional 60 N-termlnal amino adds to the 226 amino adds of the S-protein 
proper. 

An exemplary regulatory region is derived from the yeast arg ^ gene which codes for omftiilne 40 
carbamoyl-transferase (OCT). The arg» regulatory region Is subject to both specific and genera] control 
mechanisms. It has been doned In E. con on plasmid pY eura^arg^ as reported by Crabeel et al., Proc. Nati. 
Acad. Sd. II.SA . Volume 78. 6026 (1981). Prefen^d hoste are S. cerevislae strains in whtoh the arglnlne 
t blosynthetic pathway is derepressed, such as strain icl697d. Use of such strains resulte In Increased 

expression from the args promoter as compared to strain DCS. Examples of ottier useful regulatory regions 45 
Include those derived from yeast genes of the glycolytic pathway such as those coding for enolase, aldolase, 
phosphoglycerate kinase and glycerakJehyde-S-phosphate dehydrogenase: the yeast alcohol dehydrogenase 
gene; and, In general, genes Involved In catatidism such as the argbiase gene. 

A preferred vector for doning the fused DNA fre^ent Into yeast Is the plasmkl YEpIS, whtoh Is capable of 
replication and maintenance In both E. Mil and S, cerevislae and Is, therefore, known as a shuttie vector. 60 
Several otiier yeast vectors are known and availab^. The HBsAg and regulatory regtons can be Inserted Into a 
yeast vector sequentially or simultaneously. Transfonmation with plasmid vectors containing autonomous 
replication sequences (repllcons) functional in yeast wiO normally result In extra-chromosomal maintenance of 
the DNA molecule of the Invention as a piasmkL Such repflcons are well known in the art. Transformation with 
plasmid vectors not containing repllcons functional In yeast can result In Incorporation of the DNA mdecule 55 
Into chromosomal DNA. 

Vacdnes for stimulating protection against HBV Infection In humans comprising HBsAg produced t>y yeast 
in accordance witii tiie invention and a suHabie carrier can be prepared by known techniques. Use of an 
adjuvant, such as aluminum hydroxide or aluminum phosphate. Is desirable. The HBaAg so produced can also 
be combined wtth other Immunogens to prepare combination vaccines. The HBV or combination vaccines can 60 
be administered, for example, by ttie subcutaneous. Intravenous or Intramuscular route. The DNA fragment of 
the invention and the HBsAg produced thereby, can also be used as a probe for detection of HBV in btologteal 
samples by DNA hydridlzation techniques and various Immunoassays. 
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EXPRES SION OF A DNA MOLECULE COWTAININQ A FUNCTONAL DNA CODING SEQUENCE FUSED. IN 
PH^eTtO a portion of the pre S2 region of a HBV pre S2-S PROTEIN CODING SEQUENCE 

Tills Invention also relates to a recombinant DNA molecule comprising a functional DNA coding sequence ♦ 
fused. In phase, to a portion of the PiB S2 region of a HBV Pre S2-S protein coding sequence. The term "coding 

5 sequence* or 'coding region* as used herein also encompasses any functional derivative thereof. By the term 
"functional derivative' is meant a coding sequence Involving amino add alterations which, where desired, 
dent Intefere with particle formation and/or which retain immunogenlcity. Such functional derivatives can be 
prepared by conventional site specific mutagenesis. Site specific mutagenesis techniques are taught by 
Botstein et al.. Science. 229. 1 193-1201 (1985). Such fusion may occur at the N-termlnud of the Pre S2 rerfon 

10 or at some point within the Pre S2 region. There can be four types of fusions, i.e.. at the 5'-tennlnus of the 
entire Pre S2 region, at some point v^rfthln the Pre S2 region so that Pre S2 DNA is flanking both sides of the 
functional DNA coding sequence, at the 5'-termlnal of a truncated portion of the Pre S2 region, or at tine 
5'-termlnus of the S-proteln coding region so that the fusion contains no pre ^ DhJA. Various Pre S2-S protein 
coding sequences are known and can be prepared or Isolated by known technk|ues. [See, references cited 

15 herein, supra] . In one embodiment of this Invention, the ARQ3 regulatory rogpon together with 18 amino acids 
of the yeast OCT protein were fused. In phase, to a truncated Pre S2-S protein coding sequence such that the 
fused sequence contained 42 codons of the Pre S2-S protein coding sequence together vm tiie complete S 
protein coding sequence. Prefenred functional DIMA coding sequences Include the drcumsporozoite protein 
coding sequence of Plasmodium , or ariy immunogenic derivative thereof, the envelope gene coding sequence 

ai of HIV or any Immunogente derivative thereof, especially the Gz peptide, peptide 121 or Dreesman peptide 
coding sequence or any Immunogenic derivative thereof, and the Pre S2, Pre SI or the entire Pre Sl-Pie S2-S 
protein coding sequence or any Immunogenic derivative thereof. For example* a prefenred functional DNA 
coding sequence is the first 13 Ntenninal codons of the Pre S2 coding sequence wWch are then Bgatad to the 
C-temiinal 42 codons of a truncated pre S2 region of a Pre S2-S protein coding sequence. By the term 

25 'Immunogenic derivative* Is meant a sequence which Isa derivative or modified verston of a naturally occurring 
sequence whteh retains or enhances the protective Immunogente activity of the natural sequence. Such 
Immunogenic derivatives can be prepared conventional techniques. The term 'Pre S2. Pre SI or the entire 
F>re Sl-Pre S2-S protein codfng sequence' as used herein Indudes any immunogenic derivative thereof. 
By the term * the drcumsporozoite protein of Plasmodium' Is meant the immunodominant surface antigen 

30 on the spoTDZotte of Plasmodium or any immunogente derivative thereof. Useful drcumsporozoite proteins 
(CS) Include, but are not limited to. those derived from Piasmodium faldparum, P. vivax. P. ovale, and P. 
maiariae . P. falciparum, P. vivax. P. ovale and P.maiariae infect humans. 

The CS'^rotein coding sequence of Piasmodium falciparum is disclosed by Dame et al., Science. 
593-699 (1984). The CS protein coding sequence ofP. vivax Is disck>sed by Amot et aL. Sdence, 815-818 

35 (1985). The CS protein coding sequence of P. knowiesi is disdosed by Shamna et al.. Science, 779-782 
(1985). Enea et aL, Proc. Nati. Acad Sd. USA, 81, 75207524 (1984). disctese the CS protein coding sequence 
of P.cynomdgl. The CS protein coding sequences of other spedes of Piasmodium are known (see references 
cited above) and/or can be derived by conventional techniques. 
As used herein, the temis 'CSP' or "CS protdn' wfll Indude both the natural, full length drcumsporozoite 

40 protein of Plasmodium as well as any immunogenic derivative thereof. By the term 'immunogenic derivative" of t 
the drcumsporozoite protein of Piasmodium is meant (^ any derivative of ttie drcumsporozolts protein of 
Plasmodium which retains protective bnmunogente activity or (b) any protein derived from a modified coding 
sequence of the drcumsporozoite protein of Piasmodium which retains protective immunogenic activity. Such ^ 
Immunogenic derivatives can be prepared by conventior^ techniques. Some immunogente synthetic 

45 derivatives of the CS protein coding sequence of P. fadpamm are disdosed in BaOou et al. Sdence, 
996-899 (1985). See also, IMazier et al^ Sdence , 231, 156-159 (1988). 

The CS coding sequence for various spedes of Piasmodium is known and can be prepared or Isdated by 
known technk^ues. [See, e.g.. Colman etal.. W084'<)2922-A (P. knowiesi CS protdn) and Dame et al.. Sdence, 
225 , 593 (1984) (P. faJdparum CS protein)]. A DNA molecule containing the CS coding sequence, or any ottier 

SO functional DNA coding sequence, fused In phase to a portion of the Pre S2 region of a HBV Pre S2-S protein 
coding sequence can be constructed by conventional techniques, such as by Dgating the CS coding 
sequence, or any other functional DNA coding sequence. In phase to any codon within the 55 amino add Pre 
S2 coding region of the Pre S2-S protein coding sequence. 
Preferably, enough of the Pre S2 coding sequence remains after ligation of the CS coding sequence, or any 

55 other functional DNA coding sequence, to insure optimal presentation of the CS protein, or any other 
functional DNA coding sequence protein, on the HBsAg particle surface. Le.. enough Pre S2 region should 
remain so that the Pre S2 regten encoded polypeptide acts as a bridge between the functional DNA protein 
coding sequence product and the HBsAg particle surface, it has been reported that the 26 amino-terminal 
amino adds of the Pre S2 coding sequence represent a dominant antibody binding site on the Pre S2 region. 
60 [See, e.g.. MiOch et al., Sdence . 228 . 1 195-1 1^ (1985)]. Thus, if it is desired to prepare a hybrid particle which 
contains and/or presents boVf\ Pre S2 epitopes and the CS, or any other functiorml coding sequence, epitopes. 
It is preferable to insert the CS coding sequence, or any other functional DNA coding sequence, at a point 
within the Pre S2 coding region v»mich enables preparation of a hybrid partide which contains and/or presents 
both Pre S2 epitopes and CS. or any other functional DNA coding sequence, epitopes. • 
6^ The recombinant vector of this invention comprises the recombinant DNA molecule of the subject invention 
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(l.e., a functional DNA coding sequence fused, in phase, to a portion of the l^e S2 region of a HBV Pre S2-8 
protein coding sequence) operatively linked to a regulatory region. Such vector nnay additionally also comprise 
a repllcon functional in yeast. By the term 'repilcon' is meant that minimum region of DNA which functions to 
maintain stably, and extrachromosomally, such recombinant vector In a host yeast organism transformed with 
such vector. Such repiicons are well known in the art. By the term 'regulatory regk>n' is meant any DNA region 5 
whtoh regulates transcriptton and translation of the coding sequence desired to be expressed. Such 
regulatory regions are well known in the art Such a vector can be prepared by conventkmal technkijues such 
as by inserting the DNA molecule of this Invention in phase Into a vector already containing a repiicon and 
regulatory regkm In such a way that the DNA molecule Is operatively linked to such regulatory region. 
Preferably the DNA molecule Is ligated to a vector capable of expression In the yeast genus Saccharohiyces . 10 

This invention also relates to a yeast host cell transformed with such recombinant vector and to a method of 
preparing such transformed host which comprises transforming a yeast host cell with Itte recombinant vector 
of this Invention. Such transformation Is carried out by conventk>nal techniques such as those already 
described herein. Preferred yeast cells include those belonging to the genus Saccharomyces, partfcxilarly 
those belonging to the species Saccharomyces cerevlslae and Saccharomyces carlsbergensls. is 

This inventton also relates to a method of preparing a hybrid particle containing HBsAg protein whtoh 
contains and/or presents the ftJnGtk)nal DNA coding sequence product wherein euch method comprises 
culturing the transformed yeast host cells of this frwentton In appropriate culture media, and isolating the 
particle from a cell lysate or extract of such host* s culture. By the term "presented* or 'contains and/or 
presents' Is meant that the fundionai DHA coding sequence protein Is contained within the hybrid parttele 20 
containing HBsAg protein In such a way as to enable an Immune response to the functional DNA coding 
sequence protein to be produced. When desirable, although not essen tial. the presentatton of the functional 
DNA coding sequence protein does not Interfere with the Immunogente activity of the hIBsAg epitopes, thus 
yielding a bivalent vaccine. Most preferat>ly the functional DNA coding sequence protein Is presented on the 
HBsAg particle in such a way that the nuodmum number of either the functtonal DNA coding sequertce 25 
protein's epitopes or both the functional DNA coding sequence protein and HBsAg epitopes are properly 
exposed. By 'appropriate culture media' is meant that media which will enable the transfomted host to survive 
and whteh will also enable such host to express the liybrid functional DNA coding sequence-Pre S2-8 protein 
coding sequence protein product of the vector it Is transformed with in rBooverable quantity, it will be 
appreciated by one of skill in the art that the appropriate culture me«Ba to use will depend upon the host cell 30 
employed. The Isolatfon of the hybrid particle of this inventfon from a cell iysate or extract of such host* s 
culture Is carried out by conventional protein isolation techniques. 

This invention also relates to a vaccine containing the hybrid particles of this invention. Such vaccine will 
contain an Immunoprotectlve quantity of the hybrid particles of this invention and Is prepared by conventional 
techniques. By 'immunoprotectlve" is meant that enough of the hybrid particles of this Invention are 35 
administered to elicit sufficient protective antibody response against the agent desired to be protected against 
without serious side effects. Preferably, the vaccine of this invention comprises the mlceOe of this Invention. 
i.e., a micelle comprising the particle of this Invention and polysorbate. The preferred amount of polysort)ate 
comprised by such micelle Is 5-60 polysorbate per 100 )ig protein. The micelle may be prepared by the 
method disclosed In European PMent AppOcation Publication No. 0 199 698. 40 

In the vaccine of the invention, an aqueous solution of the hybrid particles of the Invention, preferably 
buffered at physfological pH, can be used directiy. Aitemativeiy, the partfde, can be admbced or adsort)ed with 
any of the various known adjuvants. Such adjuvants include, among others, aluminum hydroxkJe. 

Vaccine preparation Is generally described In New Trends and Developments In Vaccines, edited by Vdler et 
ai.. University Paric Press. Baltimore. I^aiyland. U.SA. 1978. 45 

The amount of the hybrid particle of Ibe present Invention present in each vaccine dose Is selected as an 
amount which induces an Immunoprotecttve response without significant adverse side effects In typteal 
vaccines. Such amount will vary depending upon which spedfte immunogen Is employed arKf whether or not 
the vaccine is adjuvanted. Generally, it is expected that each dose will comprise 1-1000 ^g of protein, 
preferably 1-200 (ig. An optimal amount for a particular vaccine can be ascertained by standard studies 50 
involving obsen^tion of antibody titres and other responses In subjects. Folbwlng an Initial vaccination, 
subjects will preferably receive a boost in about 4 weeks, followed by repeated boosts every sbc months for as 
long as a risk of infection exists. 

The Immune response to the hybrid particle of this Invention may be enhanced by the use of adjuvant, such 
as, but not limited to, aluminum hydroxide (alum). 55 

To exemplify the hybrid particle of this invention, a 64 codon DUA sequence coding for the repetitive epitope 
of the drcumsporozoite protein (CS) of Plasmodium ftJdparum and an eight codon CS coding sequence [see 
Dame et al.. Science . 225 . 593^599 (1984)1 were each inserted In phase to a portion of the Pre S2 coding 
sequence in a yeast expression vector comprising a raplteon, the Pre S2-S protein coding sequence and an 
appropriate regulatory region. Both expresston vectors. I.e.. one containing the 64 codon Cs sequence and eo 
one containing the 8 codon CS sequence, were each introduced Into yeast host ceils, and extracts of such 
transfonned ceils were analyzed for the presence of HBsAg epitopes and CS epitopes on the same molecule. 

The results obtained from tmmunobfot analyses using a pool of five monocfonal antibodies specific for CS 
and a monoclonal antibody raised against purified yeast-derived HBsAg showed that a 87,000 kllodaiton (Kd) 
hybrid protein was obtained from lysates of yeast transformed witti the vector containing the 64 codon CS ss 

15 



0278940 

coding sequence fused In phase with the Pre S2-S protein coding sequence, and a 30.000 Kd hylnld protein 
was obtained from lysates of yeast transformed with the vector containing the 8 codon CS coding sequence 
fused in ptese wfth the Pre 32^ protein coding sequence. Assembly of these proteins Into HBsAg 
partlcle-aice structures was shown by CsCI and sucrose gradient analysis, high pressure liquid 
S chromatography (HPLC) and eleclron microscopy. Presentation of the CS epitopes on the exterior of such 
particles was shown by the accessibility of the Cs epitope to CS specific antibodies In an EUSA assay. 
Presentation of HBsAg epitopes on the exterior of such particles was shown by radlolmmune assay (AUSTRIA 
II. Abbott) or EUSA assay (En^ost-HBsAg. Behrlngwerke). 

10 NOVEL PRE S2 AND PRE S2-S PROTEIN CODING SEQUENCES 

The novel HBV Pre S2 and Pre S2-S protein coding sequences of this Invention are from an HBV adw 
subtype which was Isolated from plasmld pRrri0616. The tenm "coding sequence* or "coding re£^n" as used 
herein also encompasses any functional derivative thereof. By the term fiinctiona} derivative" Is meant a 
coding sequence with amino add alterations which, where appropriate, dont Interfere with partlde fbnnalion 

15 and/or which retain Immunogenlclty where such retention Is desirable. Such functional derivatives can be 
prepared by conventional site specific mutagenesis such as by the method of Botstein et al.. Science, 
1193-12D1 (1985). Such Pre S2 and Pre S2rS protein coding sequences can be recovered by conventional 
recomblnantDNAproceduresfrom plasmld pRrn0616whlch was deposited (within E.TO!lK12strrfnC600) In 
accordance with the Budapest Treaty at the American Type Culture Collection, RockviUe. Maryland, on June 2, 

20 1982, under accession number ATCC 38131. The Pre S2 region of such novel protein coding sequences codes 
for the following amino add sequence: 

-55 -50 
25 Met-Gln-Trp-Asn-Set-Tht-Ala-Phe-His-Gln-Ala-^eu-Gln-Asp- 

-40 -30 
Pro-Arg-Val-Arg-Gly-Leu-Tyr--Phe-Pto-Ala-Gly-Gly-Ser-Ser- 



30 



45 



-20 

Ser-Gly-Thr-Val-Asn-Pro-Ala-Pro-Asn-Ile-Ala-Ser-His-Ile- 



-10 

^ Ser-Ser-Sejc-Ser-Ala-Arg-Thr-Gly-Asp-Pto-Val-Thc-Asn. 

Such region may also be obtained by conventional synthetic ollgonudeotide synthesis. A preferred 
functional derivative of the Pre S2-S coding sequence of ttils Invention is one vj^hereln the alanine (GOT) amino 
40 acid at -45 Is modified via site specific mutagenesis to a threonine (ACT). Such functional derivative results In 
two times greater level of expression bv Saccharomyces cereveslae transformed therewith as com pared to 
the unmodified coding sequence of this Invention. 

A recombinant vector of this Invention comprises the Pre S2 or Pre-S2-S protein coding sequence of the 
subject Invention operatlvBly linked to a regulatory region. Such vector may also additionally contain a repllcon 
depending on preference and/or the host to be employed. By the term "repHcon" Is meant that minimum 
region of DNA which functions to maintain stably, and exlrachromosomaliy. such recombinant vector In a h<^ 
eukajyotic or prokaryotic organism transformed with such vector. Such repllcons are well-known In the art By 
the term "regulatory region" Is meant any DNA sequence which contains a promoter regton and other 
sequences necessary for transcription of a coding sequence. Such regulatory regions are well-known In the 
50 art Such a vector can be prepared by conventional techniques such as by Inserting the Pre S2 or Pre S2-S 
protein coding sequence of this Inventton In phase Into a vector already containing a repllcon and regulatory 
regton In such a vi«y that the DNA molecule Is operatively Dnked to such regulatory region. Preferably the DNA 
molecule Is llgated to a vector capable of expression In yeast such as Saccharomyces . 
This Invention also relates to a host cell transformed with such recombinant vector and to a method of 
55 preparing such transformed host Such transformation Is canied out by conventional techniques. Host cells 
can be prokaryotic or eukaryotic. Preferred host cells are eukaryotic especially yeast cells and Include ti)ose 



and Saccharomyces carisbergensis. 
This Invention also relates to a method of preparing a protein encoded by the Pre S2 or Pre S2-S pro tein 

60 coding sequence of this Invention wherein such method comprises culturing the transfbrmed host cells of this 
Invention in appropriate culture media, and Isolating the protein from the host cell culture fluid or from a cell 
lysate or extract of such host* s culture depending upon the host cell's ability to secrete such protein. By 
'appropriate culture media" Is meant that media which will enable the transformed host of this invention to 
sundve and which will also enable such host to express the Pre S2 or Pre S2-S protein coding sequence of this 

65 Invention In recoverable quantity. It win be appreciated by one of skill In the art that the appropriate culture 
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media to use will depend upon the host cell employed. The isolation of the Pre S2-^ protein of this Invention 
from a culture lysate or the culture fluid of such host is carried out by conventional protein isolation 
techniques. TTie proton prepared by the method of this Invention may be gtycosytated depending upon the 
host cell's ability to do so. If a deglycosylated protein is desired, the protein of this Invention should be 
prepared by utilizing a host cell Incapable of effecting such glyco^lation or by employing conventional site 5 
specific mutagenesis on the protein's coding sequence prior to canying out the proceaa of this Invention, 
such as by the method of Botstein et al.. Science , 229, 1193-1201 (1985). 

This invention also relates to a vaccine containing an immunoprotectlve amount of the Pre S2 or Pre S2-8 
protein of this Invention. Such vaccine may be prepared by conventional techniques. 

This Invention also relates to a method of preparing a hybrid particle containing HBsAg protein which 10 
comprises protein encoded by the PreS2nS protein coding sequence of this invention wherein such method 
comprises culturing the transformed eukaryotic host cells of this invention in appropriate culture media, and 
isolating the particle from the host ceO culture fluid or from a ceH lysate or extract of such hosfs culture 
depending upon the transtbnned host cells ability to secrete such particles. By 'appropriate culture meda' is 
meant that media which wDI enable the transformed host of this Invention to sunrive and which will also enable is 
such host to express the Pre S2-S protein coding sequence of this Invention In parttde form and In recoverable 
quantity. It will be appreciated by one of sidll in the art that the appropriate culture rnedia to use will depend 
upon the host ceil employed. The Isolation of the hybrid particle of this Invention from a culture lysate or the 
culture fluid of such host is carried out by conventional isolation technlquea 

This invention also relates to a vaccine containing the hybrid partides of this Invention. Such vaccine will so 
contain an Immunoprotective quantity of the hybrid particles of this invention and are prepared by conventional 
techniques. Preferably, the vaccine of this invention comprises the micelle of this invention. l.e., a micelle 
comprising the particle of this Invention and polysorbate. The prefen^d amount of polysorbate comprised by 
such micelle Is 5-60 pg polysoriaate per 100 pg fxroteln. The micelle may be prepared by the method disclosed 
In European Patent AppOcation Pid>llcation No. 0 199 898. 2S 

In the vaccine of the Invention, an aqueous solution of the Pre S2^ protein or hybrid particle of tiie 
invention, preferably buffered at physiological pH, can be used dIrecUy. Altennattvely, the protein or particle 
can be admixed or adsorbed wtth any of the various known adjuvants. Such adjuvants Include, among ottiere. 
aluminum hydroxide. 

Vaccine preparation is generally described In New Trends and Developments in Vaccines, edited by VoHer et 30 
al.. University Park Press. Baltimore. Maryland, U.S7V., 1978. Encapsulation wttiiln liposomes Is described, for 
example, by FuDerton. U.S. Patent 4.235,877. Conjugation of proteins to macromoiecules Is disclosed, for 
example, by Ukhite, U.S. Patent 4,372.946 and by Armor et al., U.S. Patent 4.474,767. 

The amount of the Pre S2-S protein or hybrid p«rticle of ttie present Invention contained In each vacdne 
dose Is selected as an amount which Induces an Immunoprotectlve response without significant, adverse side 35 
effects In typical vaccines. Such amount win vary depending upon which specific immunogen is employed and 
whetiier or not ttie vaccine Is adjuvanted. Generally, tt Is expected ttiat each dose win comprise 1-1000 (xg of 
protein, preferably 1-200 pg. An optimal amount for a particular vaccine can be ascertained by standard 
studies involving observation of antibody titres and other responses In subjects. Following an Initial 
vaccination, subjects will preferably receive a boost In about 4 wedks, followed by repeated boosts every sbc 40 
months for as long as a risk of Infection exists. 

The immune response to the Pre S2-S protein or hybrid particle of this Invention Is erihanced by the use of 
adjuvant such as. but not limited to, alum. 

NOVEL PRE S1 PROTEIN CODING SEQUENCE 4S 

The novel HBV Pre SI protein coding sequence of this invention is from an HBV adw subtype which was 
Isolated from piasmid pRm0616. The temi 'coding sequence' or 'coding region' as used herein also 
encompasses any fonctionai derivative thereof. By the terni functional derivative' Is meant a coding sequence 
involving amino acid alterations which , when desired, dont Interfere wtth particle f onmation and/or which retain 
Immunogenlcity. Such functional derivatives can be prepared by conventional site spedflc mutagenesis such ao 
as by ttie mettiod off Botstein etal.. Science. 229. 1193-1201 (1985). Such Pre SI protein codlr^ sequence can 
be recovered by conventional recombinant DNA procedures from plaamld pRrT10616 which was dspostted 
(witiiin E. coll K12 strain ceoo) In accordance wtth ttie Budapest Treaty at ttie American Type Cutture 
Collection. Roclo^ne. Maryland, on June 2. 1982 under accession number ATCC 38131. The Pre SI region 
codes for the foltowing amino acid sequence: SS 
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ATQ GGG ACQ AAT CTt TCT GTT CCC AAC CCT CT6 
Met GlY Th£ Asn Leu SeE Val Pco Asa Pro Leu 

5 

G6A TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

10 

6CA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

15 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Trp Asp Phe Asn Pro He Lys Asp His Tcp 

20 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

25 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pro Gly Leu Thr Pro Pco His Gly 

30 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Trp Ser Pro Gin Ala Gin 

35 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Ql7 He Leu Thr Thr Val Ser Thr He Pro 

40 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

45 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

50 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 



Such region may also be obtained by conventional synthetic oligonucleotide synthesis. 

A recombinant vector of this Invention comprises the Pre 81 protein coding sequence of the subject 
invention operatlveiy linked to a regulatory region and may additionally comprise a repDcon depending on 
preference and/or the host to be employed. By the term 'repiicoa* Is meant that minimum region of DMA 
^ which functions to maintain stably* and extrachromosomaHy. such recombinant vector In a host eukaryotic or 
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prokaiyotlc organism transfCNmed with such vector. Such repHcons are well-known in the art By the term 
"regulatory region' Is meant a DMA setquence which contains a promoter region and other sequences 
necessary for transcription of a coding sequence. Such regulatory regions are weH-known In the art Such a 
vector can be prepared by conveotlonal techniques such as by Inserting the Pre SI protein coding sequence 
of this invention In phase Into a vector already containing a replicon and regulatory region In such a way that s 
the DMA molecule Is operatfvety linked to such regulatory region. Preferably the DNA molecule is l^ed to a 
vector capable of expression in yeast such as Saccharomyces. 

This Invention also relates to a host cell transfbrnied with such recombinant vector and to a method of 
preparing such transfomned host. Such transformation is carried out by conventional techn^ues. Fiost cells 
can be prokaryotic or eukaryotlc. Prefenred host cells are eukaryotic especially yeast cells and include those 10 
bek>nging to the genus Saccharomyces , particularly those belonging to the species Saccharomyces 
cerevlsiae and Saccharomyces carlsbergiensls . 

This invention also relates to a method of preparing protein encoded by the Pre SI protein coding sequence 
of this Invention wherein such method comprises culturing the transformed host calls of this Inventton in 
appropriate culture media, and isolating the protein from the host cell culture fluid or ftom a cell lysate or is 
extract of such host's culture depending upon th9 transformed host celPs abfilty to secrete such protein. By 
'appropriate culture media" is meant that media which will enable the transformed host of this Invention to 
survive and which wiD also enable such host to express the Pre SI protehi cocfing sequence of this Invention In 
recoverable quantity. It will be appreciated by one of skDI in the art that the appropriate culture mecOa to use will 
depend upon the host cell empteiyed. The isdatfon of the Pre SI protein of this biventbn from a pulture lysate 20 
or the culture fluid of such host is carried out by conventional isolation techniques. The protein prepared by the 
method of this Inventton may be glycosylated depending upon the host cell's ability to do so. If a 
deglycosylated protein Is desired, the protein of this Invention shoukJ be prepared by utilizing a host cell 
incapable of effecting such glycosylatlon or by employing conventional site speclfto mutagenesis on the 
protein's coding sequence prior to carrying out the process of this invention. Site specific mutagenesis Is 2S. 
carried out by such as by the method of Botstein et ai.. Science , 229 , 1193-1201 ^1985). 

This Invention also relates to a vaccina containing the Pre SI protein of this invention. Such vaccine wIQ 
contain an Immunoprotective quantity of the Pre SI protein of this Invention and is prepared by conventional 
techniques. 

In the vaccine of the invention, an aqueous solution of the Pre 81 protein of the Invention, preferably 3a 
buffered at physiological pH, can be used cOrectiy. Alternatively, the Pre SI protein, can be admixed or 
adsorbed with any of the various known adjuvants. Such adjuvants Include, among ottiers, aluminum 
hydroxide. 

Vaccine preparatk>n is generally described In New Trends and Developments In Vaccines, edited by Voller et 
ai.. University Park Press, Baltimore, Maryland, U.SJV., 1978. Encapsulation within liposomes Is described, for 35 
example, by Fullerton, U.S. Patent 4,235,877. Conjugation of proteins to macromolecules Is disclosed, for 
example, by Ukhite, U.S. Patent 4,372,945 and by Armor et al., U.S. Patent 4.474,:^. 

The amoung of the Pre SI protein of the present invention present In each vaccine dose Is selected as an 
amount which Induces an immurx>protecttve response without slgnfficarrt. adverse side effects in typtoai 
vaccines. Such amount will vary depending upon which specific Immunogen is employed and whether or not 40 
tile vaccine is adjuvanted. Qeneralfy. it Is expected that e^ dose will comprise 1-1000 m ^ prcteln. 
preferably 1-200 }ig. An optimal amount fbr a particular vaccine can be ascertained by standard studies 
Involving observation of antbody titres and ottier responses In subjects. FdHowIng an Initial vaccination, 
subjects wOI preferably receive abpost in about 4 vraeks, fblkywed by repeated boosts every six months for as 
long as a risk of Infection exists. The Immune response to the PreSI protein of this ijiyention is enhanced by 45 
the use of adjuvant such as, but not limited to, alum. 

NOVEL PRESI -PRES2-S PROTEIN CODING SEQUENCE 

This Invention also relates to a HBV PreS1-PreS2-S protein coding eequence of whk^ the PreS1-PreS2 
portion comprises the following amino add sequence: so 
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PRE^Sl REGION 
-163 . 

AT6 GG6 ACQ AAT CTT TCT 6TT CCC AAC CCT CT6 

5 

Met Gly Thr Asn Leu Ser Val Pco Asn Pro Leu 
GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 

10 

Gly Phe Phe Pco Asp His Gin Leu Asp Pco 
GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT 

IS 

Ala Phe Gly Ala Asn Sec Asn Asn Pco Asp 

^ TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Tcp Asp Phe Asn Pco lie Lys Asp His Tcp 

^ CCA GCA GCC AAC CAG GTA GGA' GTG GGA GCA 
Pco Ala Ala Asn Gin Val Gly Val Gly Ala 

^ TTC GGQ CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pco Gly Leu The Pco Pco His Gly 

35 GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Tcp Sec Pco Gin Ala Gin 

40 GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu The The Val Sec The He Pco 

45 



so 



S5 



60 



65 



20 



9 



0278M0 

CCr CCT GCC TCC ACC AAT CG6 CAO TCA GGA 
Pro Pro Ala Ser Thr Asn Arg Gin Ser 61y 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro lie Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser Hie Pro Gin Ala 
PRE-S2 REGION 

-SS 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 

Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 

ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn He Ala Sec 
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IS 



20 



so 



35 



CAC ATA TCG TCA AGC TCC GCG* AGG ACT GGG 40 
His He Ser Ser Ser Ser Ala Arg Thr Gly 

GAC CCT 6TG ACG AAC 45 
Asp Pro Val Thr Asn 



The novel HBV Pre Sl-Pre S2-S protein coding sequence of this Invention Is from an HBV adw subtype 
which was Isolated from plasmid pRm0616 which, as stated above. Is available from the American Type 
Culture CoOectlon. Rockvllle. Maryland, under accession number ATCC 38131. The tonn "coding sequence" or 
•coding region" as used herein also encompasses any functional derivative thereof. By the tenn functional 
derivative" Is meant a coding sequence with amino add afterallons which, where appropriate, dont Interfere « 
with particle formation and/or which retain immunogenicity where such retentton Is desirable. Such functional 
derivatives can be prepared by conventional site specific mutagenesis such as the method of Botsteln et 
ai.. Science. 1193-1201 (1985). 

A recombinant vector of this Invention comprises the Pre-SI-Pre S2-S protein -coding sequence of the 
subject Invention operotively liniced to a regulatory region. Such vector may also addftfonally contain a repllcon 
depending upon preference and/or the host to be eniployed. By the term "repllcon" Is meant that minimum 
region of DMA which functions to maintain stably, the extrachromosomally, such recombinant vector In a host 
eukaiyotic or prokaryotlc organism transformed with such vector. Such repilcons are well-known In the art By 
the terni "regulatory region" Is meant any DMA sequence which contains a promoter region and other 
sequences necessaiy for transcription of a coding sequence which regulates transcription of a stnicturai 
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gene's coding sequence. Such regulatory regions are well-known In the art. Such a vector can be prepared by 
conventional techniques such as by Inserting the Pre S1- Pre S2-S-proteIn coding sequence of this Invention 
In phase Into a vector already containing a repllcon and regulatory region In such a way that the DMA molecule 
Is operatlvely linked to such regulatory region. Preferably the DNA molecule is ligated to a vector capable of 

5 expression In yeast such as Saccharomyces. 

This Invention also relates to a host cell transformed with such recombinant vector and to a method of 
preparing such transformed host Such transfonnatlon is carried out by conventional techn^ues. Host cells 
can be prokaryotto or eukaryotic. Preferred host cells are eukaryotic expectally yeast cells and Include those 
betonging to the genus Saccharomyces , particularly those belonging to the species Saccharomyces 

10 cerevislae and Saccharomyces cartst)ergensls. 

This Invention also relates to a method of preparing a protein encoded by the Pre Sl-Pre S2-S protein 
coding sequence of this inventton wherein such method comprises culturing the transformed host cells of this 
invention In appropriate culture media, and isolating the proton from the host ceil culture fluid or from a ceD 
iysate or extract of such host's culture depencfing upon the host cell's ability to secrete such protein. By 

IS •appropriate culture media* Is meant that media which wfll enable the transfomned host of this Invention to 
survive and which will also enable such host to express the Pre-SI Pre S2-S protein coding sequence product 
of this invention In recoverable quantity. It wiU be appreciated by one of skill In the art that the appropriate 
culture media to use vAW depend upon the host ceD employed. The isolation of the Pre Sl-Pre S2-S protein of 
this invention from a culture Iysate or the culture fluid of such host Is carried out by conventional protein 

20 isoiatton technkjues. The protein prepared by the method of this Invention may be glycosylated depending 
upon the host cell's abiDty to do so. If a deglycosylated protein Is desired, the protein of this Invention should 
be prepared by utilizing a host cell Incapable of effecting such glycosytaHon or by employing conventional site 
specific mutagenesis on the protein's coding sequence prior to carrying out the process of this Invention, 
such as by the method of Botstein et aL. Science, 229. 1193-12D1 (1985). The Pre Sl-Pre S2^ coding 

^ sequence of this Invention wiU be expressed as a glycosylated protein by a yeast or mammalian host cett 
transformed therewith. The Pre S1-Pre S2-S coding sequence of this lnventfon wifl be expressed as a 
N-myristylated protein by a yeast host cell transformed therewith. 

This Invention also relates to a vaccine containing an Immunoprotective amount of the Pre SI- Pre S2-S 
protein of this invention. Such vacdne may be prepared by conventional techniques. 

30 This Invention also relates to a method of preparing a hybrid particle containing IHBsAg protein whteh 
comprises protein encoded by the Pre S1-PreS2-S protein coding sequence of this Invention wherein such 
method comprises culturing the transfonned eukaryotic host cells of this Invention In appropriate culture 
media, and isolating the particle ftom the host cell culture fluid or from a cell Iysate or extract of such hosf s 
culture depending upon the transformed host cells ability to secrete such protein. By "^proprtate culture 

35 media" Is meant that mecfia which will enable the transfonned host of this Invention to survive and which will 
' also enable such host to express the pre Sl-Pre protein coding sequence of this invention In particle 
fomi and In recoverable quantity. It wlB be appreciated by one of sMIl in the art that the appropriate culture 
media to use will depend upon the host cell employed. The isolation of the particle of tWs Invention from a 
culture I^ate or the culture fluid of such host Is carried out by conventional isolation technk]ues. 

40 This Invention also relates to a vaccine containing the hybrid particles of this Invention. Such vaccine wW 
contain an Imm wwprotective quantity of tiie hybrid particles of this invention and are prepared by conventional 
techniques. Preferably, the vaccine of this Invention comprises the micelle of this Invention, l.e., a micelle 
comprising the particle of this Invention and polysoibate. The preferred amount of polysori3ate comprised by 
such micelle Is 5-60 jig polysort>ate per 100 |ig protein. The micelle may be prepared by the method disclosed 

45 in European Patent Application PubHcation No. 0 199 698. 

In the vaccine of the invention, an aqueous solution of the Pre Sl-Pre S2-S protein or the hybrid particle of 
the invention, preferably buffered at physfofogteal pH. can be used directiy. Aftematively, the protein or particle 
can be admixed or adsori^ed witti any of the various known adjuvants. Such adjuvants Include, among othere, 
aluminum hydroxide. 

50 Vaccine preparation is generally described in New Trends and Developments in Vaccines, edited by Voller et 
al.. University Pari< Press. Baltimore, Maryland. U.S.A., 1978. Encapsulation within flposomes is descrit>ed, for 
example, by Fullerton. U.S. Patent 4.235.877. Conjugation of proteins to macromdecules is disdosed, for 
example, by Ukhite, U.S. F»alent 4,372,945 and by Annor et al., U.S. Patent 4,474,757. 
The amount of the Pre Sl-Pre S2-S protein or hybrid particle of the present Invention present In each 

56 vaccine dose Is selected as an amount v^lch induces an Immunoprotective response without significant, 
adverse side effects In typical vaccines. Such amount will vary depending upon which specific Immunogen is 
employed and whether or not the vaccine Is adjuvanted. Generally, it is expected that each dose wilt comprise 
1-1000 |ig of protein, preferably 1-200 \iq. An optimal amount for a particular vaccine can be ascertained by 
standard studies Involving observation of antibody litres and other responses in subjects. Following an initial 

60 vacdnation, subjecte will preferably receive a boost In about 4 weeks, followed by repeated booste every six 
months for as bng as a risk of infection existe. 

The immune response to the Pre Sl-Pre S2-S protein or hybrid particle of this invention Is enhanced tiy the 
use of adjuvant such as, but not limited to, alum. 

65 



22 



9 



0278940 

EXPRESSION OF PRES1-PflES2-S PROTEIN IN YEAST 

This Invention also relates to a recombinant DNA vector comprising an entire PreS1-PreS2-^ protein coding 
sequence operatively ilniced to an expression control sequence. The term 'coding sequence" or "coding 
region" as used iierein also encompasses any functional derivative thereof. By the temn "functional derivative* 
is meant a coding sequence with amino acid alterations which, where appropriate, dont ^rfere with parUde 5 
fomnation and/or which retain immunogqnicHy where such retention Is desirable. Such functional derivatives 
can be prepared by conventional site specific mutagenesis such as by the method of Botstelh et al.. Science, 
229. 1193-1201 (1985). Such vector mc^ additionally comprise a repiicon depending Mpon preference and/or 
the host to be employed. Preferably, the PreS1-PreS2-^ protein coding sequence ^ployed is the one of this 
invention. io 

By the term "repiicon" Is meant that minimum region of Dl^ which functions to maintain stably, and 
extrachromosomalty, such recombinant vector In a host yeast organism transformed with the vector of this 
invention. Such repUcons are weH-iotown in the art By the term "regulatory region" is meant any DNA 
sequence which contains a promoter region and other sequences necessary for transcription of a coding 
sequence which regulates transcription of a structural gene's coding sequence. Such regiMory regions are IS 
welMcnown in the art. Such a vector can be prepared by conventional techniques such as by inserting a PreSI- 
PreS2-S protein coding sequence In phase Into a vector already containing a repiicon and regulatory region In 
such a way that the DNA molecule Is operatively linked to such regulatory region. Prsferably the DNA molecule 
is Dgated to a vector capable of expression in the yeast genus Saccharomyces . Preferably the Pre S1- Pre S2-S 
.protein coding sequence contained by the vector of this invention Is the Pre SI- Pre S2-S protoln coding 20 
sequence of this Invention descrit>ed hereinabove. Such vectore are also useful for the insertion therein of 
functional DNA coding sequences to enable expression of the resulting Pre 81-functionai DNA cocfing 
sequence-Pre S2-S protein coding sequence comprised by such vector. Therefore, this invention also relates 
to a recombinant DNA molecule comprising a functional DNA coding sequence fiised In phase into the Pre SI 
region of the Pre SI -Pre S2-S protein coding sequence, in such a way that the functional DNA sequence either JBpf 
forms an insertion in the Pre S1 region or replaces part of the Pre SI region and possit)^ a portion of the Pre 
S2 region, and to a vector comprising such racombbiant DNA molecule. See, for example. Example 21 A, infra. 
By the term "functional DNA coding sequence" Is meant a DNA coding sequence which, when fUsed. In phase. 
Into the Pre 31 region of the Pre Sl-Pre S2-8 proton coding sequence. Is not Involved with, and does not 
interfere with, the assembly of the HBsAg particle. Prefwred functional DNA coding sequences include but are 30 
not limited to, the drcumsporozolte coding sequence of Plasmodium or any immunogenic derivative thereof, 
the envelope gene coding sequence of HIV or any immunogenic derivative thereof, espedaiiy the 0/ peptide, 
peptide 121 or Dreesman peptide coding sequence as hereinafter exemplified, pr sequences coding for 
peptides of interest firom other viruses e)q>eclany those fbr coat proteins or for surface proteins firom other 
parasites. 35 

This Invention also relates to a yeast host ceD transformed with such recombinant vector and to a method of 
preparing such transformed host Such transformation is carried out by conventionpl techniques. Preferred 
yeast ceils and Include those belonging, to the genus Saccharomyoes. partlculariy those belonging to the 
species Saccharomyces cerevisiae. 

This invention also relates to a method of preparing a protein encoded by a Pre Sl-Pre S2-S protein coding 40 
sequence or Pre Sl-functlonal DNA coding sequence-Pre SZS protein coding sequence wherein such 
method comprises culturing the transformed host cells of this Invention In appropriate culture media, end 
Isolating the protein firom the host cell culhire fluid or from a cell lysate or extract of such hosf s culture 
depending upon the host cell's ability to secrete such protein. By "appropriate culture media* Is meant that 
media which wDl enable the transfonned host of this invention to survive and which will also enable such host 45 
to express a Pre Sl-Pre S2-S protein coding sequence or Pre Sl-functlonal DNA coding sequence - Pre S2-S 
protein coding sequence product In recoverable quantity. It wiO be appreciated by one of sidli in the art that the 
appropriate culture media to use will depend upon the host cell employed. The isolation of the Pre S1-Pre 62-^ 
protein or Pre SI -functional DNA coding sequence - f*re S2-S protein from a cell lysate or the culture fluid of 
such host culture Is carried out by conventional protein Isolation techniques. The protein prepared by the SO 
method of tNs invention may be glycosylated depending upon the host cell's abmty to do so. If a 
deglycosylated protein is desired, the protein of this Invention should be prepared 1^ utilizing a host ceil 
Incapable of effecting such glycosylation or by employing conventional site specffic mutagenesis on the 
protein's coding sequence prior to carrying out the process of this Invention, such as by the method of 
Botetein et al., Science . 229, 1193-1201 (1985). S6 

This invention also relates to a vaccine containing an immunoprotective amount of Pre-SI-Pre S2-S protein 
or Pre SI- functional DNA coding sequence - Pre S2-S protein produced by the method of this invention. Such 
vaccine may be prepared by conventional techniques. 

This Invention also relates to a method of preparing a hybrid particle which comprises protein encoded by a 
PreS1-PreS2-S protein coding sequence or Pre Sl-functlonal DNA coding sequence-Pre S2-S protein eo 
coding sequence, wherein such method comprises culturing the transfonned yeast host cells of this Invention 
In appropriate culture media, and Isolating the particle from the host cell culture fluid or from a cell lysate or 
extract of such hosf s culture depending upon the transformed host ceils ability to secrete such particles. 1^ 
"appropriate culture media" is meant that media which will enable the transfonned host to aurvlve and which 
will also enable such host to express the Pre Sl-Pre S2-S protein coding sequence or Pre 81-functtonal DNA 6S 
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coding sequence - Pre S2-S protein coding sequence In particle form and In recoverable quantity. It win be be 
appreciated by one of sidll In the art that the appropriate culture media to use will depend upon the host cell ^ 
employed. The isolation of the particles of this Invention from a culture lysate or tfie culture fluid of such host Is 
canted out by conventional isolation techniques. 

5 This Invention also relates to a vaccine containing tiie particles of tills invention. Such vaccine will contain an 
immunoprotectiVB quantity of the particles of this invention and are prepared by conventional techniques. 
Preferably, the vaccine of this Invention comprises the micelle of this Invention, Le., a micelle oompri^ng the 
particle of this Invention and polysorbate. The prefenred amount of polysorbate comprised by such miceile is 
5-50 Jig polysorbate per 100 jig protein. The micelle may be prepared by tiie method disclosed In European 

10 Patent Appncation Pubfication No. 0 199 698. 

In tiie vaccine of tiie Invention, an aqueous solution of tiie Pre Sl-Pre S2-S protein or Pre SI functional DMA 
coding sequence - Pre S^ protein or ttie particle of the Invention, preferably buffered at physiological pH. 
can be used directiy. Alternatively, the protein particle can be admixed or adsorbed wltti any of the various 
laiown adjuvants. Such ad^ivants include, among others, aluminum hydroxide. 

IS Vaccine preparation is generally described in New Trends and Developments In Vaccines, edited by Voller et 
al.. University Parle Press, Baltfrnore, Maryland, U.SJV.. 1978. Encapsulation wWiin Hposomes is described, for 
example, by Fullerton. U.S. Patent 4,235,877. Conjugation of proteins to macromoiecuies is disclosed, for 
example, by Ultfilte, U.S. Patent 4,372,945 and by Armor et al.. U.S. Patmit 4,474,757. 
The amount of tiie Pre SI -Pre S2-S protein or Pre Sl-fUnctional DNA coding sequence - Pre S2-S protein or 

20 particle of ttie present Invention present In each vaccine dose Is selected as an amount which induces an * 
Immunoprotective response wittiout significant, adverse side effects in typical vaccines. Such amount win vary 
depending upon which specific Immunogen Is employed and whetiier or not the vaccine is adjuvanted. 
Generally, It Is expected that each dose will comprise 1-1000 |ig of protein, preferably 1-200 jig. An optimal ^ 
amount for a particular vaccine can be ascertained by standard studies Involving observation of antibody tilres 

25 and otiier responses In subjects. Following an Initial vaccination, subjects will preferably receive a boost In 
about 4 weeks, followed by repeated k>oost8 every six months for as long as a risk of Infection exists. 

The immune response to the Pre SI -Pre S^S protein or Pre SI -functional DNA coding sequence - Pre S2-S 
protein or particle of this frivention Is enhanced by the use of adjuvant such as, but not limited to, alum. 

SO EXAMPLES 

in the folk)wing examples of the invention, which are Illustrative and not limiting 

- All percentages are by weight and all temperatures are In degrees Celsius (Centigrade). 

- The enzymes used In DNA manipulations were obtained from Bethesda Research Laboratories, New England 
Biolabs, and/or Boehringer, and were empkyyed according to the suppliers' directions. 

^ - Yeast growtti media : selective medium : YNB (yeast nitrogen base wittiout amino adds (DIfco Labs) 0.675 <Vb 
(wA^) containing 2% (w/v) glucose). 

- YEPD medium : 1 <Vb (w/v) yeast extract, 2% wN peptone and 2<Vb (w/v) glucose. 

- DNA recombinant methods are described in 'Molecular Ckming", T. Manlatis et al.. Cokl Spring Hartwr Lab. 

(1982). ^ 
40 - PMSf : phenyl methyl sulphonyl fiuorkle 

- The fbllowing abbreviations m^ be emptoyed: 

PBS : Phosphate Buffered Saline (per liter): (pH 7.4) * 
8.0 gram Nad 
45 02, gram KCI 

1.15 gram Na2HP04 
0.2 gram KH2PO4 
0.1 gram CaCb 
0.1 gram MgCb • GHaO 

SO 

BSA: Bovine Serum Albumin (A437a. Sigma) 

X-gal : S-bromo^hforoindoyl-P-D-galactopyranoslde, commerdatly available from Sigma Chemical Co., St 
LjouIs, Missouri. 

SS L-BROTH (per liter): 
10 gram Tryptone 
5 gram NaCI 
5 gram Yeast extract 
1 ml 0.1M MgS04 

60 Add 5 ml of aseptic solution of thiamine (1 mg/ml) after autodaving. For soOd media, add 16 g agar per liter. 

Example 1. Construction of plasmld pRfTIOW. 

The starting material was plasmld pGy containing a 2.1 kb Hlndl ll fragment of yeast DNA encoding the TDH3 
gene cloned on pBRd22. pBR322 is described by Bolivar et al. (Gene. 2, 95-113. 1977). The p6y plasmld was 1 
65 constructed and characterized 1^ Musti et al. (Gene. 25, 133, 1983) and received from Dr. M. Rosenberg of the 
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National Institute of Health. The Hind lll fragment was recloned onto a pBR322 dertvatlve In which the EcoRI site 
had been destroyed to give plasmtd pRm0164 (a non essential maneuver). The foOowbig manipulations were 
carried out so as to create a BamHl site located at the Q residue of ftte ATQ codon of the TDI^ coding 
sequence and to obtain a HIndlll-BamHI DNA fragment with TDhl3 promoter activity In which the TDH3 
sequences are Intact and unclianged by the manipulation. DNA of pRrT10i64. preparsd as described above, s 
was purified by two cycles of CsCI - ethldlum bromide density gradient centrtfugation essentiaiJy as described 
by Kahn et al. (MettKxls In Enzymology, 268, 1979). 150 ^g of pRIT10164 DNA were digested to completion 
wfth 75 units of Xbal endonudease, extracted wttti phenol and ether, precipitated with ethanol and the DNA 
resuspended In 0.01 M tromethamlne-HCI buffer at a concentration of 1 (ig per ^I. Samples of 20 ^ of this Xbal 
treated DNA were digested with Ba^l nuclease to resect the 61 base pairs of DNA between the ATQ codon 10 
and the )^ site. DIgestons with Ba[31 were carried out at 30° for 1 to 3 minutes In buffer containing 600 mM 
Naa, 12 mM CaCI2, 12 mM MgCI2, 1 mM EDTA, 20 mM tromethamlne-HCI, pH3.1 using one unit of Bai3T 
nuclease per 20 iig DNA In a final reaction volume of 200 jii. 

The enzyme reactions were stopped by addition of ethylenebis (ro^ethylenemtrilo) tstrBacettc acid (EGTA) 
to give 20 mM final concentration and the samples were extracted with equal volumes of phenol, ether and is 
precipitated witii ethanol. Each DNA sample was resuspended In 20 }il 10 n]M tromethamlne-HCI buffer pH 7.5. 
The extent of BatSI digestion was measured by digesting about 2.5 ^ of each Dl^ sample with Hpal 
endonudease and comparing the size of the Hpal - Xbal fragments to the 335 bp Hpai-Xbal fragment from 
PRIT10164. A 2 minute digestion wltti BalSI was found to remove about 41 to 88 nucleotides from the 
Xt>ai-Hpal fragntent of pRIT10164. 20 

This result Indicates that a similar number of base pairs had been removed from the other Xbal extremity 
towards the ATQ codon. 5 ^g of the DUA recovered from the 2 minute Bal31 digest was digested with BamHl 
endonudease, extracted with phenol and recovered by ethand predpitation. This DNA was treated with 5 
units of T4 polymerase In the presence of deoxynudeotide triphosphates to fin In and render flush ended the 
BamH l and any Bat31 single stranded extremities, extracted witii phenol and recovered by ettianoi 25 
predpitation. 2.3 |ig of ttito DNA was treated wltti 5 units of T4 DNA Ogase and half of ttie Dgation mixture used 
to transform competent cells of E.od[ K12 strain MM294 prepared according to the method of Cohen et al. 
(Proa Nati. Acad. Sd. ^, 2110, 1972). One mi of the transformed E coll population was dOuted Into 350 ml 
L-broth with 200 |ig/mi ampidnin and total plasmid DNA was purified from the resulting culture. 

80 ^g of this plasmid DNA containing a population of plasmid molecules with Bal31 Induced deletions of $0 
about 40 to 80 base pairs was digested sequentially wigh 75 units of Hind lll endonudease and 96 units of 
Bam Hl endonudease to release DNA finagments from those plasmids In which a Bam Hl site had been 
recreated after the treatments described above, that Is. where the Bal31 digestion had terminated at a Q 
residue. This DNA digest was run on a preparative lOb agarose gel and the desired HIndl ll -Bam HI fragments 
corresponding to sizes frorn 1 1 00 to 1000 bp excised from the gd In two slices, one conrespondlng to DNA of 35 
about 1070 bp and the other of about 103Q bp. The DNA was recovered from the two agarose gel slices by 
cycles of freezing and thawing followed by high speed centrtfugation to peRet the agarose. The supemaiant 
liquid was filtered through a Mniipora QV MOlex filter and ttie DNA recovered by two cycles of ettiand 
predf^tion and resu^iended In 20 |tl of 0.01 M tromettiamlne-HCI buffer pH 7.5. 

Analysis tyy agarose gel electrophoresis and comparison to a Hindlir plus )^ digest of pRm0164 DNA and ^ 
to the fragments fh>m a Hindl ll plus EcoR I digest of phage X DNA showed that two distinct populations of 
HIndi ll -BamH I fragments were obtained, one witti an estimated size of about 107O bp and one witii an 
estimated size of about 1030 bp compared to ttie 1120 bp parental HIndl il-Xbal fragment fix)m pRm0164. 
About 1 00 ng of the 1030 bp HIndlll-BamHI fragment population was Dgated witti 200 ng of plasmid pUC9 which 
had been digested with Hind lll and BamHl endonucleases and treated with alkaline phosphatase (See, Slilne, 46 
U.S. Patent No. 4.264.731) and ttie ligation mixture was used to transfbmi competent cdls of E. cojl straiii 
JM103wltti selection being made for ampldllln resistance. Transformation of E. coll strain JM103 was effected 
according to the method of Cohen et al.. cited above. 

Both the pUC9 vector plasmid and the recipient E. coll strain JM103 have been described by VIeIra and 
Messing In Qene, 19, 259. 1982 and were obtained from J. Messing (Univ. of Minnesota). The pUC9 vector Is 50 
commercially available from Amersham (Buddnghamshire. United Kingdom) and Pharmacia (Uppsala. 
Sweden). E coll strain JM103 is acvailabie from J. Messing (Univ. of Minnesota). Anottier E. coO strain %vitti ttie 
characteristics of E. ^ strain JM103, l.e., JM101 , can be obtained fl^m ttie American Type SiRure Coflection. 
Roclcvilie. Maryland, under accession number ATCC 33876, and can also be employed as host About 400 
ampldllln resistant colonies per ml were obtained and of 98 colonies tested on medium containing X-gal. 95 65 
were white Indicating ttie auccessful insertion of a foreign DNA firagment between ttie Hindlll and BamHl sites 
on the vector. 

Plasmids from Individual transfomiant colonies were prepared t>y a small scale procedure (BImbdm and 
Doly, Nud. Acid Res., 7. 1513, 1979) after amplification of ttie plasmids by addition of spectinomycin (150 
)ig/ml) to the growing "cultures. eo 

The recombinant plasmids were analyzed on 7.9^ acryiamide gds after double digestion witti Avai l and 
BamHl endonudsases and compared to the 450 bp A^D^iai fragment comprising the promoter-N-tennlnus 
coding region of pRIT10164 and to ttie fivgments of a HesII digest of pBR322 DNA. An Avall -BamHI fiBgment 
conrespondlng to deletions of from 20 to 90 base paire was present In 35 of 38 plasmids examined when 
conq>ared to pRrn0164. Three plasmids representing deletions of about 80 base paIre (pRm0166) 50 base 66 
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pairs (pRIT10ie7-RgurB 8) and 45 base pairs (pRITIOieS) were chosen for further study. Plasmid DNA was 
prepared by CsCI-ethldlum bromide density centrHugatlon and 25 jig of each digested with EcoRI. The EcoRI 
extremities were (abetted with y-^^P-ATP by exchange Wnatlon and the nucleotide sequence of the 
Hlndlll^coRI fragments from each plasmid determined by the chemical modification methods of Maxam and 

5 GiibertliSeStods In Enzymology. 65. 489. 1980). LabelDng of and sequencing from the ^oRI site In the pUM 
vector portion of each recombinant plasmid is a convenient means of determining the sequence around the 
adjacent BamHI site marldng the end of the deletion Into the TDH3 DNA fragr ment This sequendng analyste 
showed tfiillrrplasmid pRfTI 01 66. the BamHI site Is recreated at a G residue located In the non tran^ted 
region 25 base pairs upstream of the ATQ codon; in pRin0165 the BamHI site Is located at the second base of 

10 the third codon; and In pRm0167 the BamHI site is located at the G residue of the ATG codon. 

The Hindlll -BamH I fragment of TDH3 DNA thus constructed In pRlTIOie/ contains an the sequences 
necessary for TDH3 promoter activity in an unaltered form. 

B campie 2. Construction of vector pRin0172. 

The 1060 bp Hindlll-BamHI TDH3 DNA Insert from pRrri0167. prepared as described in Example 1, was 
recloned ontd th'sYEplSshutfle vector described by Broach et ai. (Gene. 8, 121 . 1979) between tfie Hindill site 
In ihfl g micmn DNA portion of the vectorand ttie BamHI site to give the recombinant plasmid pRIT12159. DNA 
of pRIT12159 was ttien digested with Xbal and BamHI endonucteases and the resulting 1650 bp fragment was 
ligated in place of a similar Xbal -BamHI fragment contrining the arg8 promoter on plasmid pRm0774. Plasmid 
PRIT10774 is described by cSezonit al. (Proa Nati. Acad. Sd. U.S.. 81. 6594-6598. 1986), and comprises the 
YEp13 repGcon from which ttie natural vector's BamH I and )aiol sites have been deleted by In vitro 
manipulation togettwr witti a 1470 bp Hindlll-BamHI insert comprising ttie arg^ promoter region and a 1150 bp 
BamHI-Hlndill fragment comprising ttie argS^Binscription termination region. Replacement of tiie arg» 
mwnoterinsert on pRrri0774 by tiie TDH3 promoter insert gives plasmid pRIT10172 which tiierefore contains 
a single BamHI site located at tiie ATG codon of ttie TDH3 promoter insert and proceeding a functional signal 
for transwpon tennlnaflon on the 1150 bp BamHI-Hlndl il ai^ DNA fragment Foreign DNA can tiierefore be 
cloned at this site. Furthermore ttie two DNA inserts are bounded by Hindill sites enabfing removal of tiie 
expression unit module (cassette) as a 2200 bp Hindill fragment for Insertion on other vectors. Figure 9 is a 
restrtction endonudease cleavage map of pRnn0172. 

Example 3. Constnjction of pRm2290. ^ ^ . , - ^ 

A 1050 bp HIndlil -EcoRi fragment from pRrn0167 (F^ 8). prepared as described In Example 1, and 
containing ttie"HindIil -BamHI TDH3 promoter fragment togetiierwitii tiie BamHI-Sn^^ 
polylinker was redonedbetvireen ttie Hindi ll and EcoRI sites of a pBR322 derivative plasmid previously dieted 
fro tfie BamHI site by filling In wtth T4 DNA polymerase and religation. The resulting recombinant plasmid was 
identified as pRm2176. . ^, _ 

Rasmid DNA of pRIT 10158 (Rgure 8), prepared as described in Example 4 (b), below, was digested wrth 
EcoRI endonuclease and treated witti T4 DNA polymerase to fin In tiie EcoRI single strand extensions. This 
preparation was further digested wItti Cfal endonuclease. A further sample of pRm0158 DNA was digested 
40 wfth Clal and Haeill endonudeases and a 1150 bp ClalHaeill fragment canying ttie transcription termln^n 
redolTof tiie gene was recovered by prepsunattve agarose gel electrophoresis and electroeiutton. This 
fragment waslgated wHh ttie EooRi, T4 DNA poly merase. ^ treated pRFTIOISS DNA. and ttie Dgatlon 
mixture was used to transform cofSpetent ceDs of E, con strain MM294 prepared as described by Cohari et al.. 
above, to amplcDlln resistance. From ttie transfonned colonies, a plasmid. kJentffled as pRm0162. was 
45 isolated In which tiie ClaWaeiii arg^ transcription temfiinaflon fragment was Inserted on pRm0158 between 
tiie Clai and filled in E^i sites and In which tiie EcoRI site has been recreated. Figure 8 Is aflow sheet wtMi 
fllusBSes ttie preparation of pRIT10162. Plasmid DNA of pRIT10ie2 was digested witti E coRI and 
endonudeases. mixed witti plasmid DNA of pRm2176, prepared as described above, (also digested with 
EcoRI and Psti endonudeases) and the mixture was llgated and used to transfonnri ceils of E. coMK12 strain 
SO VS^^ to arnpldflln resistance by tiie methods of Cohen et aL, dted above. A plasmid. pRrri22D8. vras 
recovered from a cdony from tills transformation In which tiie large 4630 bp EcoRI -Psti fragment frorn 
pRm2176 has been Ogated to the smaB 1930 bp EcoRIPsti fragment from pRm0162. and in which tiie arg^ 
transcription tennlnation region Is now Juxtaposed to ttieTDH3 promoter.The arg^ and TDH3 DNA regions can 
be excised from pRm2208 as a single 2200 bp fragment by digestion witii Hindill endonudease. To obtain tiie 
SS otiier orientation of tiiis fragment witti respect to vector sequences ttie 2200 bp Hindill figment frorn 
pRrT12208 was redoned Into ttie Hindill site of a pBR322 derivative plasmid In wWch ttie natural EcoRI and 
BamHI sites had separately been deleted by filling in witti T4 DNA polymerase and reOgation. The resulting 
ptomid, wltti the 2200 bp Hindi ll fragment Inserted in ttie opposite orientation witti respect to vector 
sequences compared to pRrn2208. is identified as pRIT12290 and deposited as ATCC 67186 on August 18, 
60 1988 at the ATCC In accordance wltti Budapest Treaty's regulations. Rgure 10 is a restrtction endonudease 
deavage map of pRrn2290. In botii ttie pRIT12208 and pRrT12290 vectors, ttie TDH3 promoter and ag^ 
transcription tennlnation fragments are separated by unique BamHI . Smal and EcoRI restriction sites useful 
for the doning of DNA fragments canying coding sequences, and the promoter and transcription tennlnation 
regions can be exdsed togettier on a 2200 bp Hindi ll fragment as a module or cassette for transfer to otiier 
65 vectors. 
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The BamHl stte Is espedalty useful as it Is located at the ATG codon of the TDH3 gene and can t>e employed 
for fusion of other genes to the TDH3 pixmtoter wfth a view to expressing them In yeast as mempllfied below. 
Such fusions can be recovered from pRnri2208 or pRm2290 derivatlviis fin the iom of cassettes or modiides 
for insertion on yeast shuttle vectors by digestion with HIndi ll endonuclease or by the use of other restHctioh 
endonudeases cutting at flanldng restriction sites. 5 

Example 4. Construction of piasmlds pt)iT10677, pRIT10909 and pRm0158. 

a) pRm0677 

A 1372 bp BamH l fragment of doned HBV DNA was excised from pRrT10616 (Harford et al., Dev. Biol. 10 
Standard. 54. 12&-130. 1983) and mixed and llgated with BamH l endonuclease digested pBR327 DNA. This 
BamH l fragment contains part of the HBsAg precursor region, the HBsAg coding sequence and 3^ non coding 
sequences. From the llgationmixtureaplasmld.pRmoe77.wasselected which was thb HBV BamHl fragment 
Inserted at the BamHl site of the vector In the oriental 

the Sail site In the pBR327 vector. Figure 8 Is a flow sheet which Illustrates the preparation of pRrn0677. is 

b) pRm0909 

The 3300 bp Hlndl il fragment from ptasmid pMC200 (available from the American Type Culture Collection. 
Rockvllle, Maryland. U.SA. without restriction under accession number ATCC 39131), described by Crabeei 
M. et al (EMBO J., g, 205-212. 1983). was recloned onto a pBR^ derivative plasmid on which th6 natural 20 
EcoR I stte had been deleted by fQllng In with T4 DNA polymerase and religatlon. A resulting recombhant 
plasmid. pRfTIOISd (Rgure 8). was selected In which the HIndill aig9 gene Insert has the 3^ region of the arg^ 
gene proximal to the Oal site on the modified pBR322 vector. From pRIT10158, a 1 150 bp C^l Haelli flrBgmerit 
was recovered which canies the transcription temilnation region of the arg^ gene. This 1 150 bp fragment was 
llgated with and Hpal endonudeases digested DNA of ptesmid pRm0677. (prepared as described In Part SB 
a). at>ove, to fuse the aQp termination re^on at the Hpal sita downstream of the HBsAg coding region. The 
resulting plasnrid te pRm0909. Rgure 8 1^ aflow sheet wMch iliustratas the preparation of pRm0909. Figure 3 
is a restriction endonuclease cleavage map of pMcaoO. 

Example 5. Construction of piasmlds pRrT10911 and pRm0912. 36 

The BamH l site on pRITlOW (Example 1) and the natural BamH l site located in the precursor region of the 
HBsAg gene from a HBV virus of adw serotype cloned on pRfTIOBie (Harford et al.. Dev. Biol. Standard, 54. 
125-130. 1983) are In the same transiationai reading frame enabling the construction of a fusion of the TDH3 
promoterfragment which supplies the ATQ codon to 42 codons of the HBsAg precursor sequence followed by 
226 codons of the H BsAg coding sequence proper. The source of the HBV DNA fragment for this fusion was a 3$ 
plasmid pRIT10909 (see. Example 4, part b, above) containing a 935 bp BamHIHpal Insert encoding part of the 
HBsAg precursor region, the full HBsAg coding sequence and 128 bp of d' non translated DNA to which had 
been fused, downstream of the Hpa I site, an 1150 bp Haelll-CIaill DNA fragment thxn pRm0158 containing 
the argS transcription temdhatlon leglon. pRm0158 la described In Example 4, part B, above. The 2086 bp 
EcoRI-BamHI fragment was excised from pRnri0909 and inserted between the EooRi and BamHl sites of ^ 
pRrri0167, prepared as described In Example A, so as to create plasmid pRm0911. The 3135 bp Hlndlil 
fragment firom pRmooil containing the yeast DNA transcriptional signals and the HBsAg coding aequencee 
was then inserted on the YEp13 shuttle vector to give plas- mid pRIT1091Z Figure 8 la allow sheet iOustniting 
the preparation of pRrT10911. 

46 

Example 6. Construction of piasmlds pRIT10903 and pRn'10914 

The aim of the constructions described below was to remove excess nucleotides from the 5' end of a DNA 
fragment encoding at least the N-termlnai portion of the mature HBsAg coding sequence In such a fashion as 
to create a 6 base pair restriction site at the first base of the second codon. The resulting fragment can then be 
fused to promoter fragments with a 6 base pair restriction site at the Initiation codon using Mung Bean or S1 60 
nucleases to trim away the single strand extensions and create blunt ends for ligation as shown by Rosenberg 
et al.. Methods In Enzymology. 101C . 123 (1983). 

A 12^ bp FnuDi l fragment of HBV DNA from pRIT10616 (See. Harford et al. (1983) Devei. Bid. Standard, 54, 
125-130) and encoding the HBsAg gene was excised from this plasmid and llgated with synthetic octomer 
EcoRI nnkere, and the fragment cloned Into the EcoRi site of vector pACYC184 to give pRIT10679 (Figure 8). S6 
Vector PACYC184 was received from S. Cohen and is described by Chang A.C.Y. and Cohen S. (1978), J. 
Bacterid,. 134. 1141- 1158. pACYC184 Is also available from the American TVpe Culture Coflectlon. RockvWe. 
Maryland, U.SA. under accession number ATCC 37033. From this EcoR I (FnuDII) fifagment, a 125 bp 
EcoRI-Xbal fragment was obtained which contains the Ctenmlnal region of the HBsAg precursor, the HBsAg 
ATQ codon and 90 bp of HBsAg coding sequence. This 125 bp EcoRI-Xbai fragment was Introduced between eo 
the EcoR I and Xbal sites of pRrri0158 (see. Example 4 (b), above). The resulting plasmid, pRm0903. 
therefore contains a sin- gle EcoRI site located at the Junction between ar^ DNA and HBV DNA and has a 
single )g}ol site located in thearg^ promoter region. Rgure 8 Is a flow sheet which IDustrates preparation of 
pRm0903. 

150 pg of pRrT10903 plasmid DNA, prepared by CsO^idlum bromide density gradlertt centrtfugation, was 66 
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digested wtth 80 units of EcoRI endonudease, extracted with phenol, ethanol predpltaled and resuspended 
at 1 iifl per ulln 10 mM tromiftamine-Ha buffer (pH 7-5). The DNA was divided Into three 50 jig saniples and 
Incubated for 50. 75i?id 90 seconds with BaOl nuclease at 30^. Each reaction mixture contained the same 
reaction bufferclted in Example 1. 50 jig of EmRI digested pRrri0903 DNA. and 2.5 units of nuclease in 
5 afinal volume off 500 uJ. The reactions were stopped by addition of EQTA to 20 mM final concentration. chWIng 
on ice. extraction with an equal volume off phenol and precipitation with etha noi. ,5!^?'^^^^ 
samples were each taken up In 50 |il10 mM tromethamlne-Ha buffer and 23 iig digested wWi Bst^H 
endonudease and compared on a 7^ acrybmlde gel to an EcoRI plus Bsffll endonud^ digest of 
pRm0903whlch releasesa285 bp fragment, Treatmentwith topi nuclease for 75 seconds at 30» vrasfound 
10 to reduce the size of this EcoRIBstEII fragment by 10 to 60 base pairs gMng a series of discrete bai^ 
estimated to represent removal of 10.30.45 and 60 base paire of DMA. Resection of 29 base paire from the 
original FnuDII site would remove all the HBsAg precursor DNA and the HBsAg ATQ codon. 

6 ug ofpRiT10903 DNA viWch had been treated with BaOl for 75 seconds vras digested with 8 units of »ol 
nuctose. extracted virtth phenol and ethanol predpitated. This DNA was then Incubated with 5 unite oJT4pNA 
IS polymerasa In the presence of deoxynudeotlde triphosphates to fill In and render flush ended the )giol and 
^1 extremities, treated virith phenol and ethand precipitated. The DNA was then Incubated v»rtth 5 unite ofT4 
dFSA llgase fbr 16 houre at 16" and the mixture used to transfonn competent cells of E. coH K1 2 strain MM294 
to ampldflin resistance accoitling to the method of Cohen at al.. cited above. About 2000 ampldDIn resistant 
colonies per ml were recovered after pla^ Plasmlds were pr^jared firom 47 Individual colonies ^ a smaU 
20 scale procedure (See. BImbdm et al, dted above), and 23 of 47 were ftound to retain aJOwj stte Indl^tlve 
successful flIOng In of the original )giol ^ and llgaflon to a y end terminating In a Q residue. These 23 
plasmlds vrare further digested with )Oiol and >toal endonucleases to determine the size of the small restriction 
franmBnt In comoarison with the origired EcoRI-Xbal fragment from pRIT10903. 
Several plasmlds were Identified with deletions estimated to range from 25 to 40 base pafrs. One plasnud. 
25 pRm0914, which was estimated to contain a XhoiXbal fragment of 95 to 100 base paire was selectwl for 
further study. Figure 8 te a flow sheet which Dlustrates preparation of pRITIOOU. DNA of this plasmid v^ras 
purified tv CsClethidkim bromide density gradient centrifugatlon and 25 jig of such DNA was digested viffth 
140 unite of Xbal endonudease. u. ^ 

The Xbal termlnfl were exchange labeDed v*rfth y-»2P-ATP. and the labeDed DNA was digested with 1 5 unite of 
30 HIndlll aiaonudease. The smaH 765 bp Hlndlll>Xbal fragment was Isolated by acryiamWe gd electrophoresis 
iRdelectrodutlon fdlowved by ethand precipitation. The nucleotide sequence at and around the 2?!o[^e vras 
detemiined by sequencing this fragment firom the labdied )M terminus using the chemlcd modification 
methods of Maxam and Gilbert, dted above. The sequence date showed that the 20101 site was located at the 
first base of the second codon of the HBsAg coding sequence. 



45 



Xhol 

CTC6AGAAC 
^ HBsAg aucleotides 



This fragment Is therefore suitable for fusion to the BamHI extension of the TDH3 promoter fifagment on 
pRmOIGT (Example 1) after removal of single stranded ends. 

Ex ample 7, Construction of plasmlds pRrT12211. pRrri2a09, pRrT12230, and pRm22K 

Plasmid pRm0911. prepared as descn*bed In Example 5, was used as a steaffold for further manlpulallons. 
To Introduce desirable restriction sites, fiie plasmid was digested with BamH I and )^ endonucleases, and 
the exd^d fragment replaced by a 2275 BamHI-Xbal fragment firom plasmid pRFTIOISO (Example 4). This 
50 manipulation resufte In plasmid pRm2211 In which the HIndlil -BamHI TDH3 promoter fragment Is next to a 
BamHI-Xbal fragment containing a )giol site fdlowed by an Xbal-Hlndlll flragment comprising the C-termir^ 
rigi^ oTthe HBsAg coding sequence and 128 bp of ^ non coding DNA fused to the 1150 bp ag^ 
transcription ternilnation region. Figure 8 Is a flow sheet virhlch illustrates preparation of pRIT1221 1 . Plasmid 
PRIT12211 was digested virtth Xhol and »al endonudeases tiiereby removing a 1600 bp fragment which was 
55 replaced by the 94 bp XholXbiSfragment of modified HBsAg DNA from pRin0914. prepared as described In 
Example 5. This 94 bp ffiigment vras purified by acrylamlde gd electrophoresis from a )Oiol plus )toal digest of 
pRrT10914 DNA followed by dectrodution of the appropriate gd sDce and recovery of the DNA by ettjanol 
predpltatlon. „ . 

The plasmid, pRIT12209. resulting from the introduction of tiie 94 bp fragment (See. Rgure 8), vim punnoo 
60 by CsCI-ethldlum bromide density centrtfugation, and 50 ^g of pRiT12^ DNA was digested vwftf> unite of 
BamHI endonudease and 60 unite of Xhd endonudease to remove tiie 990 bp of extraneous DNA oe^een 
theTDH3 promoter and ttie HBsAg coding region, and then was phenol extracted and ethanol predpitated. 16 
ua of this DNA was Incubated for 30 minutes at 30* wrth 20 unite of Mung Bean nuclease (PL Blochemicals) in 
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a volume of 125 |U. The buffer used for the Incubation was 30 mM Sodium acetete (pH 4.6). 250 njM Nad, i mM 
ZnCb and m glycerol. The reaction was stopped by the addition of sodium dodecyl sulphate to gh» 0^ 
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final concentration and extraction with an equal volume of phenol followed by ethanol predpttatlon. An aliquot 
of 0^ }xg of this Mung Bean nuclease treated sample was lncut)ated with 2 units of T4 DMA ilgase for 16 hours 
at 16^ and then digested with 2 units of BamH I endonuclease to destroy any vector molecules which had 
escaped the above treatments. This mixture was used to transform competent cells of E ^ K12 strain 
MM294 with selection for amj^dllin resistance according to the method of Cohen et al., dted at>ove. About 5 
2000 ampicDIin resistant colonies per n)l of transfonnatlon mixture were recovered and plasmlds were 
prepared by a small scale procedure (according to the method of Bimbolm et al., dted above) firom ampOflsd 
cultures of 24 of these trensformants. 

16 of 24 plasmlds gave two bands of ONA with sizes to 2700 bp (pl!C9 vector) and 3000 bp after digestion 
with HIndl ll endonuclease. The 3000 bp band Is the size expected for the fusion of the HBsAq coding sequence 10 
and arg^ temilnatlon region to the TDH3 promoter fragment. 

Four candidate plasmlds were taken for further study. DMA of each plasmid was digested with Xbaf 
endonudease and labelled by exchange Idnotlon with y-^^^ATP followed by dgestlon with HIndl ll 
endonuclease and isolation of the labelled 119Q bp Hlndi ll-Xbai fragment tiy electroelutlon and ethanol 
predpitation following acrylamlde gel electrophoresis. The sequence of each fragment was determined by the is 
cfiemical modification methods of Maxam and Gilbert, dted above. TWo plasmlds were found to have the 
correct sequence ATQQAQAAC for perfect fusion of the TDH3 promoter region to the HBsAg gene. One of 
these plasmlds, pRm2230. was used bi further manipulations. Figure 8 Is a flow sheet which Illustrates 
preparation of pRIT12230. DNA of this plasmid (pRrn2230) was digested with HIndlll endonuclease and the 
3000 bp fragment containing the HBsAg coding sequence fused to the TDH3 promoter was ilgated onto SO 
shuttle vector YEp13. The resulting plasmid, pRIT12266, was transformed Into yeast strain DCS (MATa, ieu2«3, 
Ieu2--1 12, hl33, can1-1 1 ) (otytalned from J. Broach (State University of N.Y. Stony Brook) and avallable without 
restriction from the American Type Culture Collection under accession numt>er ATCC 20630) using the 
transfbnmatlon procedure of Ito et al.. J. Bact, 163, 163 (1083). 

2S 

Example 8. Construction of plasmlds pRlT12288, pRm2322 

The construction on plasmid pRIT12230 (Exarnple 7) contains 128 bp non coding HBV DNA located 2f 
downstream of the stop codon of the HBsAg structural gene. 

To remove the 128 bp of DMA, about 25 |ig of pRrri2230. prepared as described In Example 7 at>GV8, Is 
digested with Acd and EcoRI endonudeases. pRrn2230 contains a single Acd site located In the S-gene 30 
coding sequence 7 nucleotides before the TAA stop codon. The digested DNA was electrophoresed on a 1<M> 
agarose gel and the larger 4200 bp vector fragment was recovered from the gd by electroelutlon and ethanol 
predpitation. 

Artlficlal DNA linker molecules composed of a 12-mer strand and a 14Hfner strand with the fdlowing 
sequences are synttieslzed for Insertion between the AccI and EcoR I sites of pRm2330: 35 

5" ATACATTTAACG 3' 

12 met ^ 
3 • TGTAAATTGCTTAA 5 • 
14 mer 

This will restore the correct C-termlnal HBsAg codons, the TAA stop codon and provide an EcoR I site not ^ 
present elsewhere In the TDH3 promoter of HBsAg coding sequences for addition of transcription termination 
fragments. 

100 pmoles of the synthetic 12 mer and 100 pmoles of the synthetic 14 mer dngle strands are mixed 
together in a final volume of 10-20 |il, heated at 70* for 15 minutes, and diowed to return dowly to room 
temperature over a period of 3 hr permitting the anneding of the two strands dong thdr complementary ^ 
sequences. To this anneded mixture Is added about 0.15 pmole (400 ng) of the 4200 bp Acd-EcoR I fragment 
of pRm2230, IQx concentrated ligation buffer and 2 units of T4 DNA Ilgase. 

This mixture is incubated for 4 hours at 16^ before the addition of a further 2 units of T4 DNA ligasa and then 
Incubated ovemlght on ice. The Ilgated mixture Is used to transfomn competent cells of strain MM294 to 
ampidnin redstance according to the method of Cohen et d. (Proc. Natl. Acad. Sd., U.SA, 2110. 1972). ^ 
Plasmlds are prepared from 12 or more IndMdud colonies by the rapid procedure of Blmt)0}m and Doly (Nud. 
Adds Res.. 7, 1513. 1970) and examined by restriction endonudease analysis. Plasmlds with Insertion of the 
Acd to EcoR I linker will stiow a single band of DNA of 4200 t>p on digestion with either Acd or EcoRI enzyme. 
The correctness of the linker insertion on such a plasmid can be verified by DNA sequencing from either the 
EcoRI site or Acd site by the chemlcd modlflcatfon method of Maxam and QObert (Methods In Enzymofogy, 
es, 489, 1880). 

To provMe a transcrlptfon tenmlnatfon fragment, a plasmki with the Acd-EcoRI linker insert, obtdned as 
described above, is digested with EcoRI endonuclease end treated with dkdlne phosphatase (Shine J., U.S. 
Patent 4,284,731). The 2650 bp HIndlll fragment of pRrn0162. obtdned as described In Example 3. was 
recloned In the HIndl ll site of the pBR^ vector plasmid to forni plasmki pRm2288. pBRd27 Is described In 
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Soberon et aJ. (Gene, 9. 287-305, 1980), and can be prepared as described In Example 10, part b, Infra, from 
plasmid pRrri2309 which can be recovered from the American Type Culture Collection under accession 
number ATCC 67187. Figure 8 Is a flow sheet which discloses construction of pRIT12288. 
The orientation of the Hindl ll fragment on pRrri2288 Is such as to juxtapose the ag^ transcription 

5 termination region with the gcoRI and Oal restriction sites on the pBR327 moiety. From pRm2288, a 1180 bp 
EcoRI fragment b obtained wMch canrtes the ar^ transcription termination region. This fragment is ilgated to 
ttii EcoRI digested, alkaline phosphatase treated pRIT12230 derivative described above, and plasmlds Issuing 
fronr5ie"Bgation mixture are examined for Insertion and the orientation of the 1180 bp EcoRI fragment The 
orientation of the Insert can be obtained by digestion with Hindl ll endonuciease which will Gberate fragments of 

10 2880 and 2720 bp If the Insert orientation Is as desired. 

A plasmld. pRITI 2322. was constructed In which the Acci-EcoRI llnicer replaces the 128 bp of unwanted HBV 
DMA and on which the 1180 bp EcoRI fragment from pRrn2288 was present In the desired orientation, 
downstream of the HBsAg coding sequence. A 2900 bp Hind lll fragment can be excised from pRm2322 which 
carries tiie expression module for HBsAg synthesis Including tite TDH3 promoter, the HBsAg coding 

IS sequence and the an^ transcription termination region. Rgure 8 Is a flow sheet which di^oses construction 
of pRm2322. See, also. Figure 11, wWch Is a restrtetion endonuciease map of pRrT12232 and shows which 
portions thereof come from pRIT12288 and pRm2230. 

E xample 9. Construction of plasmid pRm2329 
a> As described in Example 8 above, pRm2322 contains a 2900 bp Hindlll fragment wtth all the necessary 

signals for expression of HBsAg from the TDH3 promoter This module was Bgated onto a shutUe vector for 

Introduction into yeast cells. 
DMA of shiittie vector YEp13 was digested with Hindl ll endonuciease and allcaline phosphatase and used as 

recipient for the Introduction of the Hindlll frragment from pRrT12322, prepared as described in Example 8. 
25 Plasmid pRin2329 was Isolated which consists off the YEp13 replicon together witti the 2900 bp Hindlll module 

firagment described In Example 8. 

Example 10, Construction of plasmid vectore for the expression of the hybrid antigen HBsAg-CS In yeast 

30 A. Construction of Plasmlds pRm2573 and pRn'12S74 

Plasmid WRZOI was obtained from the VWter Reed Army Institute of Research and results from insertion of 
a 2.3 kOobase (kb) EcoR I fragment from XmPfl (Dame et al.. Science, 593-599. 1984) encoding the 
complete circumsporozorte protein gene of Plasmodium fialciparum into vector pUC8, Rgure 2A shows a 
schematic restriction map and sequencing strategy of done XmPfl. The positions of restriction enzyme 

^ cleavage sites shown In the figure were determined from the sequence and conflnned by digestion: A, Avail; 
Ac. Acci; B. Bstnl; D, Oral; Dd, Ddel; F, FoW; N, Ndei; R, Rsal; S, StuI; T. Tthlll; Tq. TaqI; X. Xholl. Arrows 
Indicate ttie origin, direction and extent of the sojuences determined. The CS protein coding region Is shown 
as a heavy line. Rgure 3A shows the nucleotide sequence off the CS protein gene frt>m P. falciparum. The 
nucleotide sequence off the CS protein gene in XmPfl is shown. Vector pUC8 Is described by VIeIra et al. (Gene, 

40 19, 259, 1982). pUC8 Is commercially available from Amereham and Phanriacte. A 1215 base pair (bp) SbjI-Rsal 
fragment of plasmid WR201 containing the CS protein cocflng sequence minus the first 52 bp (Figure 12) vras 
purified by electroelution from a 7.6% polyacrylamide electrophoresis gel. This frragment was digested witii 
Sau3A to generate smaller frragments whteh Included, among others, a 192 bp Sau3A fragment coding ft>r 16 
tetapeptide repeats off tiie CS protein and ttiree Identical 24 bp Sau3A fragments coding for 2 tetrapeptide 

45 repeat of the CS protebi. 

Plasmid pRIT10911 (prepared as described In Example 5) Is a pUC9 derivative canying a 3136 bp Hindlll 
cassette containing a 1050 bp HIndiii -BamHI yeast glyceraldehyde^-dehydrogenase (TDH3) promoter 
fragment (which supplies the ATG) fused at the BamHI site to a 935 bp BamHI -Hpal flnagment of HBV DMA 
enconding the C-termlnal 42 amino adds (aa) of the pre-S2 region. 228 aa of tfie S gene (serotype adw) and 

SO 128 bp of y non-coding DNA, The S gene is flanked by a 1150 bp yeast DMA fragment carrying the arg* 
transcription tennlnator. pUC9 is described by VIeIra et aL (Gene, 19, 259-268. 1982). The BamHI site of 
pRm 091 1 , boated at the ATG Initiation codon. Is In jrfiase wfth the Sau3A sites of the repetitive epitope of CS 
of Plasmodium falciparum. DNA of plasmid pRrn0911 was digested wtth BamH I endonuciease, treated wfth 
alkaline phosphatase, extracted wfth phenol and recovered by ethanol preciprtation. 0.2 jig of this DNA was 

SS mixed wfth 0.2 Jig of tiie SauSA digest off the Stul-Rsal CS gene frragment prepared as described above, treated 
wtth T4 DNA Ogase, and the ligation mixture was used to transform competent ceUs of E. toD K12 strain MM^ 
prepared according to the metiiod of Cohen et al. (Proc. NaU. Acad. Sd., U.SA, ra, 2110, 1972). 32 plasmlds 
from Individual transfonnant colonies were prepared by a small scale procedure (Bimbobn et aL. Nucleic AcWs 
Research, 7, 1 51 3. 1979) after amplification of tiie piasmid by addftlon of spectinomydne (300 |ig per ml) to the 

60 growing cuftures. The recombinant plasmlds were analysed on 7J5f^ acrylamlde electrophoresis gel, after 
double digestion wfth BamH I and Xbal endonucleases. and were compared to ttie 219 bp BamHI-)*ai 
fragment of pRrT10911 (comprising the first 42 amino add coding sequence of the Pre S2 region precursor 
and the beginning of HBsAg region) and to the frfagments of a Haelll digest of q)x174 DNA. 
Among tiie 32 plasmlds. one recombinant showed a BamHDCba frBgment of 411 bp which coffesponds to 

6$ tiie Insertion of the 192 bp fragment Sau3A In the oonwl orientation and was designated pRfTI 2572. Another 
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recombinant showed a BamH I-Xbal fragment of 243 bp conrespondlng to the Insertion of the 24 bp SauSA 
fragment In the conrect orientation and was designated pRm2571. Rgure 12 Is a flow sheet Illustrating 
preparation of pRm2571 and pRrri2572. 

The Hindlll fragments of pRIT12572 and pRIT12571. containing the TDH3 promoter, the C&-HBsAQ hybrid 
coding sequence and the arg3 termination region were recloned onto YEp13, to give, respectively, plasrryds 5 
PRIT12574 and pRIT12573 (Figure 12). YEp13 Is described by Broach et al. (Gene. 8. 121-133. 1979). Figure 12 
Is a flow sheet niustrating preparation of pRIT1^73 and pRrn2574. " 

Plasmlds pRn'10912, pRn'12574 and pRin2573 were Introduced Into S. cerevislae yeast strain D&6 (a, 
leu2-3, lei^-112. hls3. ^1-11). and Into a cerevisiaB yeast strain 10S44C (pep4-3 lm^-3, tei^-1 12) obtained 
from M. Crabeel located at the Ceria Institute, Anderiecht, Belgium, (see, also Cabezon. T. et al., Proc. Natl. io 
Acad. Sci. \J.SA„ 81 , 6594-6598. 1984) by transfomnation of IHhIum acetate treated cells (Ito et al., J. Bacterid., 
153 . 163-168, 1983) and selection for leucine independence. Yeast strain DCS was deposited in the American 
Type Culture Collection, Rockville. Maryland, U.S.A., In accordance with the regulations of the Budapest Treaty 
on June 2, 1982, under accession number ATCC 20630. Yeast strain 10644C was deposited In the American 
Type Culture Collection. Rodcville, Maryland. U.S A, In accordance with the regulations of the Budapest Treaty 15 
on August 18. 1986. under accession number ATCC 20819. The expected hybrid protein for pRIT12573 Is 
predicted to contain 277 amino adds, and for pRm2574 is predicted to contain 333 amino adds. 

It is also possible to add to or enlarge the OS epitope region on the vector of the invention. For example, on 
plasmlds pRrT12572 and pRrn2G7l. additional SauSA fragments of CS protdn DNA, e.g.. 182 bp or 24 bp 
Sau3A fragments, can be Dgated In to the BamHI site as follows: 20 

Piasmid DNA of pRm2572 or pRrri2571 Is digested with BamH I endonuclease and treated with altcallne 
phosphatase. The 1215 bp Stul-Rsal fragment of plasmid WR201, obtained as described above, Is digested 
with Sau3A endonuclease to liberate the 192 bp and 24 bp fragments bearing the tetrapeptide repeats. These 
fragments are mixed with the BamHI and aOcallne phosphatase treated pRtt12572 or pRm2571 vectors, 
treated with T4 DNA llgase and the mixtures transformed into E. toII K12 cells according to conventional 25 
procedures with selection for ampldliin resistance. Plasmlds from transformant cdonles are screened for the 
size of the BamHI"Xt)ai fragments endonudease digestion and compared to similarly digested DNA of 
pRlt12572 or pRm2571. especially the 411 bp and 243 bp BamHI-Xbal fragments from pRIT12S72 and 
pRtt12571 which carry the fusion between the CS repeats and part of the pre 82-8 coding sequence. Increase 
lntheslzeGfthesefrBgmentsby192bpor24bplsMicatlveofthein86rfonofatBtrBpe 30 
at the BamHI site of pRm2572 and pRm^1 . The presence of a BamHI site on such ptasmlds is Indicatlva of 
Insertion of the 192 bp or 24 bp Sau3A fragments In the correct orientation for ftjslon at the ATQ codon. This 
process descrit>ed atiove may be repeated if desirable to introduce further 192 bp or 24 bp Sau3A fragments 
encoding the CS tetrapeptide repeats and elongated the CS tetrapeptide region on the hybrid partlde. As 
stated before, the Hindlll fragments from such derivatives of pRn'12572 or pRm2571 carrying the expression 35 
module or cassettes may be excised and inserted on YEp13 or other suitable yeast doning vector fbr 
Introduction Into yeast cells k>y conventional tecfmiques. 

(B) Construction of pRm2309, pRrT12314 and an expres^on shuttle vector pRm2377 containing the 
complete 2 micron DNA sequence from yeast 40 

The piasmid pRIT12309 consists of the complete 6318 bp 2 micron DNA sequence from yeast doned Into 
the EcoRI site of the piasmid vector pBR327. The 6318 bp 2 micron DNA sequence was obtained from piasmid 
pCV19 from J. Broach, Princeton University, NJ. Piasmid pRm2909 was deposited on August 18. 1986 in the 
American Type Culture Collection, Rockville, Maryland, U.SJL, In E. odi K12 strain MM924 In accordance with 
the regulations of the Budapest Treaty under accession number ATCC 67187. Rgure 13 is a restriction 45 
endonuclease map of pRn'12309. Insertion on pBR327 of the 2 micron DNA sequence through one of the two 
EcoR I sites present on this mdecule Intemipts ttte 2 micron A coding region and rendere the resulting hyt>rid 
molecule pRnri2309 ineffective for A gene function, also refen-ed to as FLP function. For a description of 2 
micron structure and function see Broach J., "The Molecular Biology of the yeast Saccharomyces : Ufa cyde 
and Inheritance", pp. 44&470. Strathem J.N., Jones E.W. and Broach J.R. (Eds). Cold Spring Hartxr 50 
Laboratory, New Yort( (1981). 

pBR327 is described lay Soberon et al. (Gene. 9, 287-305, 1980) and was obtained from F. Bolivar 
(Department of Molecular Biology. National University of Mexico, Mexlco.20DF). ft will be appredoted by one 
sidllsd in the art that pBR327 can be recovered from piasmid pRIT12309 above by prepttring pRrn2309 
piasmid DNA recovered from the E. coll strain deposited in the American Type Culture Cdiectlon under the 55 
accession AJCC 67187 by any of a nwnber of conventional methods, digesting this DNA with EcoRI 
endonuclease, iigating the digest and transfonnnlng competent cells of E. toH K12 to an^McHlln or tetracydine 
resistance with the ligation mixture. A substantial number of the transformant colonies wlfl contain plasmlds bi 
which only the pBR327 moiety of pRrri2309 Is present devoid of either of the two 2 micron DNA fragments. ' 

The 2850 bp Cial-Sall fragment containing the TDH3"arg^ expression cassette and flanking pBR322 €0 
sequences from pRm2290 (described In Example 3, above) was redoned onto the pRrT12309 vector between 
the Clai and SaU sites. The resulting piasmid, pRm2314, was digested with SaJI endonudease and Qgated with 
the 2218 bpSall-Xhoi fragment from YEp13 containing the yeast LEU2 gene. Rgure 14 Is a flow sheet 
illustrating preparation of pRIT12314. YEp13 is available fkrom the American Type Culture Collection under 
accession number ATCC 37115. The ligation mixture was used to transform competent cells of strain JA221 es 
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prepared according to the method of Cohen et al (above) to both leucine Independence and ampIcUDn 
resistance. 

Strain JA221 Is pubOdy available, and was obtained from M. Rosenberg, Smith Kline & French Laboratories. ' 
Inc., Philadeiphia and be obtained from the American IVpe Culture Collection under mession number 
5 ATCC 33875. 

A plasmid, pRm2544. was obtained from a colony from this transformation b wMch the 2218 bp Sall-Xhol 
LEU2 gene fragment was inserted at the ^ site of pRfTI 2314. The orientation of the Insertion Is such so as to 
recreate the Sail site on the side closest to the IDH^fr^ expression cassette. Plasmid pR fTI 2544 is an E. 
coll - yeast shuttle vector with single BamHl and Smal sites located between the TDH3 promoter and a^ 
10 transcription termination regions which are suitable for Insertion of DNA coding sequences to effect their 
expression from the TDH3 promoter. Rgure 14 Is a flow sheet lUustFoting preparation of pRm2544. 

(C) Constmction of Plasmid pRm2658 
The 3328 bp Hindi ll fragment of pRm2574, a YEp13 derivative prepared as described In Example 10, Part A, 

IS was redoned onto a pBR322 A BamH l plasmid (where the BamHl site has been destroyed by fining in with T4 
DNA polymerase) to give pRIT12606 where the BamHl site located at the ATQ initiation codon of the 
CS-HBsAg coding sequence is ur^que (Figure 15). The 2560 bp BamHl-^1 fragment from pRrT12608 
containing the CSHBsAg coding sequence, the arg^ transcription temilnation racoon and fianldng pBR322 
sequence was racloned between the BamHl and Sail sites of pRm2544 (Rgure ^S^. described above, to give 

20 plasmid pRfTiasSd (Rgure 15). This exchange placed the CS-HBsAg coding sequences under the control of f 
the TDH3 promoter on a complete 2 micron yeast shuttle vector. Rgure 15 is a flow sheet fflustrating 
preparation of pRm2658. 

m 

Example 11, Synthe^ of hybrid protein In yeast 

25 Yeast (Saccharomyces cerevlslae) strains DC5 or 10S44C were transformed according to the method of ito 
et al. (J. BacterioL, m 163-188, 1983) with each of the plasmids pRIT10912 (Example 5). pRm^73 (Example 
10); pRm2574 (Example 10) or pRm2658 (Example 10), or plasmids pRiT12329. (Example 9) and pRrT10172 
(Example 2), and were separate^ grown In liquid medium lacking leucine (YNB or YNB + 80 (ig/ml histidlne) 
and harvested in mld-iog phase. Cells from 1 or 2 ml culture were collected by centrifugatlon and were 

30 reauspended in 50 |U of ai25 M tris44a (pH6.8) containing 2Sm glycerol. 4A/b SDS. 6 urea and ^(m 
Z-mercaptoethanoL After Incubation for 5 minutes (min.) at MXf, the sample was divided In hahres and both 
were eiectrophoresed through a 12.5iVb separating gel, 5iyb staddng gel according to the method of LAemmO 
(Nature. 680. 1971). 

HBsAg and CS protein sequences were identified by protein immunoblotting technique (See, Bumetto, 

3S AnaL Blochem.. 112, 195-203, 1981; Qerahonl and Pelade. Anal. Biochem.. 131. 1-15, 1983; Towbin and 
Gordon. J. Immunol. Methods. 72, S^3r3A0, 1984). After electro- blotting of the proteins onto a nitrocellulose 
filter (Schleicher and SchQD. 0.45 ji) (Towbin et aL. Proc. Natl Acad. Scl., U.S A, 76. 43504354, 1979) the filter 
was preincubated for one hour at ST'C with 3(Vb getettin In PBS. Sut>sequently the filter was washed 5 times 
for 5 min. each time with PBS containing 0.1<W) Twoen 20 (pdyoxyethytene sorbitan monolaurate). and one-half 

40 of the sheet was treated for one iK>ur at room temperature with a mixture of five monoclonal antibodies against t 
CS protein (Dame et aL, Science, S93-S9B, 1984) In PBS. 1<Vb gelatin. The other half of the sheet was 
treated with e monoclonal antibody against HBaAg. HepYTAS12, in PBS, ^W6 gelatin. HepYTAS12 is an 
antibody raised against purified yeast derived HBs/^ which Is directed against a reduction and denaturing ^ 
resistent epitope. The filter sheets were washed 5 times for 5 min. each time with PBS. 0. 1 (Vb TWeen 20, and a 

45 second biotinylated sheep antMnouse antibody (Amersham) in PBS. IQfo gelatin was added for one hour at 
room temperature. The sheets were washed again with PBS, 0.mfo Tween 20, 5 times for 5 min. each time, and 
were incubated for 30 min. at room temperature wltii streptavidin-blotinylated horseradish peroxidase (HRP) 
complex (Amereham). The sheets were washed again 3 times for 5 min. each time with PBS. O.IQfe Tween 20, 
and were finally incubated with 30 \»1 H2O2 and HRP Color Development Reagent containing 4-chlorol-naphtol 

SO (Blo-Rad), 30 mg In 10 ml methanol, In 50 ml PBS. The molecular weight of the antigens was estimated from 
prestained protein maricere. ovalbumin, alphachymotrypslnogen. beta-teictogiobin. tysozyme, cytochrome C 
(BRL). 

Both yeast strains DC5 and 10S44C. transformed witii pRiT12573, showed production of an antigen reacting 
with both anti-CS and anti-HBsAg antibodies, and of an estimated molecular weight of 30,000 kiiodaltons (kd). 
55 Both yeast Strains DCSand 10S44C,trBnsfomted with eitiierpRlT12574orpRm2858 showed productionof an 
antigen capable of reacting with both anti-CS and anti-HBsAg antibodies, and of an estimated molecular 
weight of 37,000 kd. Some lower molecular weight antigen was also detected, indicative of some degradation 
of the hybrid protdn. Control samples of cells containing plasmids wittiout any CS or HBsAg sequences 
(pRn'10172) did not show any reaction. Control samples of Ceils containlong plasmids with only HBsAg 
60 sequences (pRlT12329 or pRrn0912) showed reaction only with the antl-HBsAg antibodies. 

These resuits show that a hybrid protein of the expected molecular weight containing both CS and HBsAg 
sequences is synthesized In yeast transformed with a plasmid containing the CS coding sequence fused in 
phase to the PreS2-HBsAg coding sequence. 
Furthermore, while the band Intensity In the anti-HBsAg reaction was about the same for pRrT12573 and » 
65 pRm2574, the Intensity of ttie anti-CS reaction was much higher for pRrn^4 than for pRm257a This 



32 



0278940 



indicates that for the same quantity of HBsAg sequences there are more sequences of the repetitive epitope 
of the CS protein present in pRIT1^4 than In pRrn2573. 

Example 12. Synthesis of hybrid particles presenting the CSP epitope 

Yeast (Saccharomyces cerevlslae) strains DC6 or 10S44C containing either pRm0172 (Example 2), 5 
PRIT12329 (Example 9). pRIT12573 (Example 10) or pRm2574 (Example 10), were grown In selective medium 
(200 ml YNB or YNB + 80 (ig/ml histldlne) to end of log phase. CeDs were collected by centriftjgatlon and 
resuspended In 10 ml of 50 mM sodium phosphate buffer (pH 7.4) containing OS^ TWeen 20 (polyoxyethylene 
sor bitan monolaurate), 1 mM PMSF (phenylmethylsulfonyl fluoride) and Isopropanol. Cells were 
dismpted by passage twice through a French Press at 20,000 psi (1.38 x io^ Pa). The suspension was 10 
centrifuged for 30 min. at 30 OCX) xg. The total protein concentration of the supernatant liquid (cnide cell 
extract), was measured by the method of Lowry et al. (J. Biol. Chem.. 193, 265-275, 1951) with bovine serum 
albumin as the standard. The presencs ^ HBsAg was assayed using a radlolmmune assay kit (AUSRIA 11) 
commercially avaOable firom Abbott Laboratories, North Chicago, ininols or using an EUSA kit (Enzygnost- 
HBsAg mtoro) commercially available from Behringwarice. Marburg. W. Germany. The availabHIty of CS is 
sequences In the HBsAg particles to an Immunoreaction was tested In ttte above-mentioned EUSA assay. 
Appropriate dilutions of samples to be tested On PBS containing 0.2% BSA) were aOowed to bind (overnight at 
room temperature) to tiie solid phase anti-HBsAg (Enzygnost kit). The bound HBsAg particles were Incubated 
for one hour at 3r*C wWi an 1/10.000 dilution (In PBS containing O.HVb BSA) of a mixture of five monoclonal 
antibodies directed against the CS protein (Dame et al.. Science, 593-599, 1984) as the first antibody, and SO 
ttien incubated wttti an 1/500 dilution (In PBS containing 0.1(Vb BSA) of blotinylated sheep anti-mouse Ig 
(/^ersham) as the second antibody for one hour at ST'C. Detection was by lncut>ation wttti an 1/1000 dilution 
(in PBS containing O.IQ/b PBA) of streptavklinbk>tinylated horseradish peroxtelase complex (Amersham) and 
chromogen from the Enzygnost kit for 30 mIn. at 37° C. Between Incubations tiie trq^ were washed four times 
wHh wasMng solution firom the Ere^gnost kit Color devek>pment was measured al 510 nm In a Titertek 2S 
MuMskan (Dynatech). 

Measurement of protein concentration and AUSRIA activity (Table 11) in crude cell extracts showed that 
pRrn2573 directed HBsAg expression at the same level as pRrT12329. However, pRm2574 showed about a 
ten-fold lower AUSRIA activity. lmmunobk>t analysis of tiiese extracts showed about an equal amount of 
HBsAg related protein synttiesized In those ttiree cases. SO 

Table II - A USRIA activity la crude cell extracts 



Plasmid 



Gene 



Protein HBsAg (RIA) HBsAg 
(mg/ml) (ng/ml) ng/ng 
pgoteift 



40 



PRIT10172 
PRIT12329 
PRIT12573 
PRIT12574 





5.6 


0 


0 




3.3 


5700 


1730 


PreS2-S 


3.6 


4000 


1110 


PreS2-S 


2.8 


450 


160 



4S 



SO 



The EUSA assay showed that a structure expressing botii HBsAg and CS epitopes Is present In pRnri2573 
and pRrn2574 extracts (Table "0 and tiurt tiiere are more CS epitopes per HBsAg epitopes that react In 
pRrn2574 extracts than In pRrT12573 extracts. 

To examine the nature of these RIA and EUSA positive structures, one ml of the crude cell extracts was 
mixed wltti 1.5 M Csa In 50 mM sodium phosphate (pH 7.4) and centrifuged for 40 hours at 40.000xg In a 
Beckman SW50.1 rotor. Coflected fractions were assayed for tiie presence of HBsAg and CS antibody binding 
activity wtth tiie RIA and EUSA metiiods described above. HBsAg and CS antigenic activity was found to 
coequill brate In both pRrT12573 and pRm2574 gradients. Their buoyant density was found to be sllghtiy 
higher (rho - 1Jao q/cnfi) tiian tiie one for pRnn2329 (itio - 1.18 g/cm^). 

Immunoblot analy^s of the gradient fractions conflnned the RIA/EUSA results. HBsAg and/or CS 
sequences were found only In those fractions that did react In the RIA/EUSA assay. 
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maAa aad CS epitopes present la antigen 



45 



so 



ss 



60 



65 



bound to immobilized anti-H BsAa antibody 



5 


Ccude cell 


Dilution 


ELISA (HBsAg) 


ELISA (CS) 




extract from: 






OD-.. nm 
SIO 


10 


1 nfi44C fDRIT10172) 


lOOx 


0.000 


0.065 


IS 


iOfi44C fpRIT12329) 


lOOx 


0.S80 


0.054 




200X 


0.341 


0.041 




• 


400X 


0.188 


0.056 


20 


10S44C (PRIT12573) 


lOOx 


1.298 


1.596 






200x 


1.080 


1.299 






400X 


0.702 


*0.898 


25 




800x 


0.336 


0.560 






1600X 


0.15S 


0.323 


30 


- 


3200X 


0.075 


0.189 




10S44C {PRIT12574) 


lOOx 


0.662 


1.678 


35 




200X ' 
400X 


0.370 
0.183 


1.506 
1.130 






800X 


0.079 


0.798 


40 




1600X 


0.044 


0.449 




3200X 


0.061 


0.205 



These results Indicate that the hybrid protein produced by «vrMaon of P™T1»p andpRIT^4A)es 
assemble Into particles Uke the 22 nm HBsAg particles upon synthesis In yMsrt. TTw P»'"«'«\PIS?i!flSi 
expression of pRITIZSra and pRrT12574 do expose the CS sequences on their «SS2^!SJ!S£^ 

to reart with antl^ antibodies, in the case of pRm2574.th^ 
aeoesslbinty of the a deteimlnanl of HBsAg (the one mainly responsible for AUSRIA aoUvtty) In the RIA test 

Example 13. 

was dartned by polyethylene glycol (PEG) 400 precipitation and concentrated by ultralOliutlon In an AMKXJN 
DC eaulDped with a cartridge having a cut-off of 100.000 daltons. 

Furttier purification of the hybrid particles was done by methods known In the art. for example. CsCI 
centrifugatlon. hydroxyapatlte chromatography and gel filtiration. «. 

The final purified preparation from ceH extract of 10S44C containing pRIT12574 stowed one "jj" P««J5 *» 
exclusion volume of a HPLC TSK-3000 column (LKB) which contained both HBsAg and^ f^"*^: 
SD&Do|yacrylamkle gel electrophoresis (LaemmO. Nature.^. 680. 1971) toltov>«dby silver 8tetab|gO»^ 
td. Arei. Bkwhem.. 118. 197-203. 1981) showed one major band of estimated molecular weight (MW) 37.000 
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kd. Electron microscopy after negative staining with uranyl acetate showed particies of about the same size 
and shape as yeast derived IHBsAg. 

B. Purtficatfon of hybrid particles 

Crude yeast extracts are prepared by Dyno Mill disruption of the washed yeast ceils In the presence of an 
equal vrolght of 50 mi^ sodium phosphate extraction buffer (pH 8.1 ) supplerhented with 4 mM THrlpiex III, 1 Qte 
Tween 20. 4 mM PMSF and 10 % Isopropanol. 

Cell debris are removed by centrifugatk>n at 1 1 .OOOxg for 90 min. Five hundred ml centrlfuged crude extract 
are adjusted to pH 7^ with IN acetic acid and stlned overnight at 4X In the presence of 10 gr colloidal SWca 
(AerosD 380, Degussa). Subsequently, the suspension Is centrifuged at 3.500 g for 1 hour and the pellet Is 
washed three times with 150 mi of 0.15 M Naa supplemented with 2 mM PMSF and 5 mM EDTA during 15 mIn. 
Rnally the Aerosll pellet Is eluted with 250 ml 10 mM pyrophosphate buffer (pH 9^ containing 2 Wt> TWeen 20 at 

C for 3 hours. To the desorbate is added dropwise a 1 M CaCl2 solution to a final concentration of 30 mM 
CaDa and the pH is adjusted to 7. The solution Is left overnight at 4^*0 and centrlfuged at 6.500g during 15 
mtnutes.The supernatant Is dlalysed agalndt 2 x 51 10 mM TrteAia (pH 7.1) at 4^*0 and subsequently diluted 
four times with dialysis buffer. 

After addition of 30 mM CaDa and 1 M NaQ the diluted antigen solution, adjusted to pH 7.1 . Is passed over a 
150 ml column of Phenyl Sepharose FF, equilibrated In the same Caa2/NaCI containing Trie buffer, at allow 
rate of 30 cm/hr. Subsequentiy the column Is washed wtth equilibration buffer until Obaao nmdecreases to 
zero, and eluted at tite same flow rate wtth 6 M urea In 10 mM Tris/Ha pH9. 

Aftemativeiy, the dlalysed and four times diluted antigen soiuWon Is supplemented wtth 20 % (NH4)2S04 (pH 
7) and passed over a 150 ml column of Phenyl Sepharose FF, equilibrated in 10 mM Trts/HCI (pH 7.1) 20 <Vb 
(NH4)2S04, at a flow rate of 30 cm/hr. After washing ttie column wHh equiUbnettion buffer, the ^mn Is eluted 
at tiie same flow rate wtth 10 mM TrIs/Ha pH 7.1. 

The antigen posttive fractions are pooled and finally centrffuged m one or two successlva Csa den stty 
gradients at 4* C and 245,000 g for 65 hrs. ITie purified hybrid Cs-HBsAg particles have a buoyant denstty of 
^J2S gr/cm». 

Example 14. Immunogenictty of hybrid particies 

Hybrid particles, purified from S. cerev^ae strain 10S44C containing pRm2574, prepared by ttie method of 
Example 13, were Inoculated Into laboratory animals, as such or wtth aluminum hydroxide as adjuvant (Young 
et al.. Science 958-9^ 1985). 

C57B1/6 mice (6-8 weeks), guinea pigs, and rabbtts were immunized sutxnitaneously and tntraperitoneally 
on days 0. 21 and 49. Animals v^ere bled on days 7, 28. 56 and 84. Blood was allowed to clot at 4*C ovemlgtit 
and sera were separated and stored at -70^C until used. 

Rabbtts, 2 groups of 2 animals each, received 10 jig of the hybrid CS-HBsAg protein In 1 .0 ml doses, wtth or 
wtthout aluminum hydroxide, at each immunlzBtion. As a control, 2 rabbtts were Immunized wtth 10 |ig 
Heptavax B* (Merck Sharpe a Dohme) fdlowlng ttie same Inocutation scheme. Mice and guinea pigs, In 
groups of five animals, received 1 }ig arid 5 |is hybrid protein respectively. In 0.5 mi doses at each 
immunization. Nonlmmunlzed animals served as controls. 

For detemnlnation of antibody responses, sera from mice were pooled for each group while guinea pig and 
rabbtt sera were assayed IndMduaDy. Anti-CS antibody titers were measured by EUSA assay liaing the 
recombinant CS protein R32tet^ as antigen (See, Young et al.. Science. 968-^ 1985). Antf-HBaAg 
titers were detennlned by using the AUSAB ktt (Abbott Laboratories) anSTffie WHO reference. 

Netther guinea pigs nor mice developed appreciable antibody responses to HBsAg desptte repeated 
boosting. In constrast, both animal species developed antl-CS antibodies and tfiese responses were 
augmented by boosting. An atxsorisance of 1.0 m the EUSA assay was obtained wtth Serum dilutions of 1600 
fold (mice) and 9000 fold (guinea pigs). Immunogenictty of the hybrid particles was not enhanced by 
adaorption to aluminum hycfaroxide. 

in contrast rabbtts developed antibodlea to both CS and HBaAg. Reciprocal titers at an absorbance of 1 .0 Ih 
the anti-CS EUSA assay were between 800 and 3200. Anti-HBsAg titers obtained wtth the hybrid particles 
ad8orit>ed to aluminum hydroxide were between 4000 and 20.000 mUi/ml while the rabbtta Immunized wtth 
Heptavax gave titers of 10* to 10® mUi/ml. Adsorption of the hybrid particies to aluminum liydroxide enhanced 
the immune rssponse. 

Sera from mice, guinea pigs and rabbtts all reacted strongly In the circumsporozotte precipitin reaction as 
well as Inblbtted sporozotte Invasion of hepatoma cells In vttro (Young et al.. Science, ^6^62, 1985). (See 
Table IV). Botii reactions are Indicative of a protective Immune response. 
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Table IV. Clrcumsporozoite precipitin (CSPl reacti- 
vity, and percent inhibition of sporozoite 
invasion f% ISI) of HepG2-A 16 hepatoma cells 



AnlBal(s) 

1 week DOSt 2nd boost 


CSP Reactivity 

(A* 2* 0) 


% ISI 


Pooled mouse sera 


14 


9 2 


91% 


Rabbit #36 


22 


3 0 


96% 


Guinea pig #1107 


24 


1 0 


100% 



Degree of CSP reactivity is shown in parentheses: 
0 - No CSP reactivity detectable 
24- - granular precipitate on surface of spbrozoite 
44- - threadlike filament from end of sporozoite 



Example 15. Vaccine Preparation 

30 An Illustrative vaccine of the subject Invention is prepared as IbBows: To a buffered, aqueous solution of 3Q/b 
aluminum hydroxide (10 mM sodium phosphate, 150 mM NaCI. pH 6.8; sterilized tjy filtration), the particle of 
the Invention In similar buffer is added with stirring to a final concentration of 100 jig/ml of polypeptide and 0.5 
mg/ml of aluminum (AP+). The pH Is maintained at 6.8. The mixture Is left overnight at about 0" C. Thlmersol Is 
added to a final concentration of 0.005<^. The pH Is checked and adjusted. If necessary, to 6.8. 

SS While the above fully describes the invention and all preferred embodiments thereof. It is to be appreciated 
that the invention Is not Dmited to the embodiments particularly described but rather includes all modifications 
thereof coming within the scope of the folioMring ciaiins. 

EXAI^PI,ES OF Pre 52^ PROTEIN AND PreSI-Pre S2-^ PROTEIN PRODUCTION BY YEAST 
40 It has been discovered that the insertion of the Pre S2-^ protein coding region of this Invention in a yeast 9 
expndssion vector into a yeast results In the synthesis of particles resembling the authentic 22 nm HBsAg 
particles exposing glycosylated Pre S2 protein on their surface. It has not been previously conectiy reported 
that yeast, transfonmed witii any portion of the HBV genome, express a glycosylated product 
The following examples retete to the expression In yeast of the glycosylated form of a protein containing the 
45 Pre S2 region in addition to the S jmtein of l^epatitis B surfece antigen. The protein can be extracted and 
purified from yeast cells In the form of particles for use as a human vaccine. The Pre S1-l^ S2-S protein 
expressed in yeast cells Is also glycosylated and may also t>e extracted and purified therefrom in the form of 
particles for use in a vaccine against Hepatitis B virus. 
Witii respect to the finding of glycosyteition. the following should be noted: 
SO 1. Examples 25, 26 and 27 concern concanavalln A binding, tunlcamycin effect and sensitivity to Endo 

H. respectively. Each prove that tiie pre S2-8 protein species of 33Kd is glycosylated. Thus, glycosylation 
has been proven t>y three Independent methods. 

2. The specificity of these reagents are. in general terms: 

a) concanavalln A binds preferentially to mannose residues. 
SS b) tunlcamydn inhibits ail types of glycosylation that are N-glycosldlcally bound to aspara^e. 

c) endoglycosldase H cleaves asparagtne-llnked Ngh mannose ollgosacchardles leaving one 
N-acetylglucosamlne residue attached to the asparagine. 

3. From the combined results of Examples 25, 26, 27 and the above, it is deduced that there is a 
N-linked oligosaccharide chain present on the 33Kd form of the pre S2-45 protein (tunlcamycin Inhibition, 

SO sensitivity to Endo H) which Is of tiw high-niannose type (sensitivity to Endo H). The shift in molecular 

v^ight observed (alK>ut 3000) upon tunlcamydn treatment and Endo H digestion, Indicates tfiat ttie 
oligosaccharide structure is composed of alsout 10 sugar residues. This is about the size of a core 
glycosylation In yeast Thus, probably only one of the three potential glycosylation sites In the Pre ^-^S 
protein Is glyco^rlated. 9 

SS 4. The purification schemes descrfbeD In Examples 28 and 29 both contain an affinity chromatography 
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step based on sugar residues of a glycoprotein. 

5. PhenylborBnatehasaspedficityforl^ls-dlol groups (2 adjacent OH groups sterlcallyposWoned In 
the same dfrectlon). 

6. Lentil Sepharose Is ghjcose/mannose specific. 

Example 16. Construction of pRm2662 - an E. coll vector containing the PreS2-S ewresslon cassette (Roure 
16). 

Plasmid pRm2290. obtained as described in Example 3. above, was modified by the insertion of a 
BarnHHEcoRI synthetic adaptor fragment encoding the N-terminai amino acids of the pre S2 region between 
the TDH3 promoter and ARQ3 tennlnator fragments. This was done as follows: 

Plasmid pRrn2290 was d^ested with BamHI and EcoRI enzymes and dephosphoryiated with alkaHne phos 
phatase. 200 pmoles of the phosphorylated synthetic adaptor. 

Gin Trp 

Bami 5' GATC GAG TGG 3* BcoRI 

3' GTC ACCTTAA 5* 



lie 
Tyr Stop 
5 • ATACATTTAACG 3 ' 

he&l EcoR I 
3 • TGTAAATTGCTTAA 5 • 



10 



IS 



were llgated with 0.1 pmoles of the BamHI-EcoRI dephosphoryiated plasmid pRIT12290 using 1 unit (U) of T4 20 
ngase and used to translbrm the E. coH strain MM294 to ampicillin resistance. One transformant harboring the 
desired plasmid with the synthetic adaptor inserted between the BamHI and EcoRI sites was Identified and 
retained. The plasmid Is Identified as pRrn2621. Starting with plasmid pRm2621. the next steps wore the 
elimination of the four extra base pairs (the core of the BamHI site) in order to put the ATQ codon in phase wHh 
the second codonolthe Pre S2 region, and the insertion cf an EcoRI fragment contalnlr^ 2S 
coding region and the complete S coding region Into the EcoRI site of pRm2621 to oomplete the entire pro 
S2-S coding region. 

At this stage the DhJA manipulations Including subsequent Onearlzation, deletion and redrcularization of the 
plasmid pRrT12621, were realized without ampMcation by translbnnation of intermediate plasmids: the 
procedure was as follows: ^ 

30 pmoles (120 |jig) of pRrT12621 DNA were linearized with 120 U of BamHI. After eKmlnaUon of the 
restriction enzyme by phenol extraction, the plasmid was ethanol precipitated and resuspended in the 
appropriate buffer to digest the BamHI single strand extensions with 280 units (U) of mung bean nuclease. ITie 
nuclease was eliminated by phenol extraction followed by ethand precipitation. The so treated plasmid was 
resuspended in the ligation buffdr and redrcularized with 10 units of T4 ligase. ^ 

The preparation containing the Rgated plasmid was phenolized, ethanol precipitated and resuspended Into 
appropriate buffer and digested with EcoR I enzyme. After elimination of EcoRI endonudease by phenoilzBtion 
and ethanol precipitation, 0.05 pmoles (0.2 (ig) of DNA were resuspended In the Rgatlon buffer In order to be 
llgated with 0.4 pmoles (0.2 pg) of a 838 base pair EcoRI fragment Isolated from pRm2581 that contains the 
entire 0-temi!nal coding region of the pre S2-S protein. A plasmid essentially similar to pRFTI^I can be 4» 
constructed as follows. The vector pUC9 (available commerdaily from Amersham and Pharmacia) is digested 
with EcoRI endonuclease and treated with allcailne phosphatase (Shine. U.S. patent 4,264,731). Plasmid 
pRiT10616 (ATCC 39131), is digested with EcoRI and AccI restriction endonucleases and the 826 EcoRMccI 
fragment containing part of the pre S2-8 coding region Is recovered by preparative acryimQe*^ 
electrophoresis and electroelution. About 0.4 pmoles (0 j2 |ig) of this fragment Is mixed with about 10 pmolea 45 
of the fbllowfrig 12 bp synthetic adaptor previously annealed as 
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described in Example 8, above, and the mbcture Is treated with T4 DNA ligase and digested with EcoR I 
restriction endonuclease. To the so traated mlxhjre Is added about 0 |ig of the EcoR I digested allidtne 
phosphatase treated pUC9 vector, prepared as described above, and the mixture is treated with T4 DI^ ligase » 
and used to transform competent cells of E. ooll K12 to amplclUln resistance by conventional procedures. 
Colonies from the transfonnant are examined for the presence of a plasmid consisting of the 2650 bp pUC9 
vector together with a 838 bp EcoR I fragment comprised of the 826 bp EcoR I -AccI pre S2-S fragment together 
witii the 12 bp Acci -EcoR I synthetic linker. The correctness of the constmction so Identtflsd can be verified by 
DNA sequencing using the chemical modification method of Maxam and GHbert (Methods In Enzymology, 65, 65 
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499, 1980). preferably from a unique flanking restriction site in ttie pUC9 polyflnker 

The ligation mixture containing plasmld DMA of pRn'12621, treated as desolbed at>ove, and the 838 bp 
EcoR i fragment of pRrn2581 was used to transform the E> coll strain MM294 according to the method of 
Cohen et ai., cited above. One transfomiant having a plasmld with the EcoRi fragment in correct orientation 
was retained and the plasmld Is identified as pRIT12662 (See, Figure 1A and Figure 16). Rgure 1A is a flow 
sheet Illustrating the preparation of pRm2662. Figure 16 is a restriction endonudease map of pRrri2662. The 
pre S2 region from pRn'12662 was sequenced by the chemical modification procedure of Maxam and Gilbert 
(Methods In En2ymoiogy, 489, 1980) to chedc that the coding reading frame at the N-termlnus was as 
follows: 



Met Gin Trp Asn Ser 
AAAC AAAC AAA ATG CAG TGG AAT TCC 
pTDH3 pre S2-S 

coding region 



it will be appreciated by one skilled in the art that the vector pRrT12662 may also be used for the Introduction 
of further functional DNA sequences into the preS2 region by appropriate In vltro manipulation of vector DNA. 
Hie preS2 coding sequence contains restriction sites for EcoRi. ^tl and BamHi endonucieases. Any of these 
sites msy be used for Introduction of functlondi DNA sequences encoding peptides of interest to create In 
phase fusions with the preS2 region. Such fusions will t>e of the type preS2 - introduced sequence - preS2«^ 
These restriction sites may also b& manlputated in vitro to create other vectors and provide other sites or 
reading frames for insertion of functiona] DNA sequences, such as by the metiiod of Botstein et ai. (Science, 
229. 1193-iaDI. 1985). 

Example 17, Construction of pRm2377 and of pRm2660 ayeast plasmld expressing the prBS2-^ protein 

DNA of plasmld pRm2309, obtained as described in Example 10(B). was digested with Sail endonuclease 
and Dgated with a 221 8 bp Xhoi-Saii fragment of DNA containing the yeast LEU2 gene obtained by digestion of 
YEp13 and purification of the fragment t>y acrylamide gel electrophorssls. The ligated mixture was used to 
transform cells of E. Ki2 strain JA221, prepared according to the procedure of Cohen et al.. cited above, 
with selection t>eing made for both ampicinin resistance and leucine independence. Strain JA221 was obtained 
from M. Rosenberg, Smith Kline and French Laboratories, Philadelphia, Pennsylvania, U.SA, and Is available 
from the American Type Culture Collection under accession number 33875. A plasmld, pRIT12377, was 
identified from one transformant colony which contains the 2218 bp I.EII2 )Qiol-Sa[l fragment inserted at the 
S^i site of pRin2309. 

The orientation of the fragment on the vector is such as to recreate the Sa[i site on the Aval side of the 
pBR327 part of the vector. Plasmid pRIT12377 Is an E. coll -yeast shuttle vector havlnglRe necessary 
functions for selection, replication and maintenance in both orgar)isms. 

The 3050 base pair Hindl ll fragment from pRIT12662 (Example 16), containing the pre S2-S expression 
cassette was purified t)y acrylamide gel electrophoreds, treated with T4 polymerase and ligated to pRrn2377 
prsviously opened t)y BamHi enzyme and T4 polymerase repaired This preparation was used to transform 
strain E. toD MM294 to ampiciliin resistance according to the method of Cohen et al., dted above. An E. coil 
transformant harboring the plasmid pRm2660 was retained. Figure 17 is a flow sheet IDustratlng preparation 
of pRrn2660. 

This plasmld, pRrn2660, was used to transform two strains of S. cerevlslae, Le., strain DCS dr^ and stra^ 
10S44C dr^, according to the method described by Hinnen et ai. (Proa Natl. Acad. Sd.. USA, 79, 2157-2161. 
1978). ~ 

Both strain DCS dr^ and 10S44C clr^ were deposited in the American Type Culture Collection, Rocicvllle, 
Maryland. U.S A, in accordance with the Budapest Treaty on August 18, 1988 under accession numt)ere ATCC 
20820 and ATCC 20818, respectively. 

Example 18 . TTie preS2 region in plasmld pRn'12662. 

The preS2 region in plasmld pRm2662 was sequenced using the chemical modification method of Maxam 
and Gilbert, Proc. Natl. Acad. Sd., 74, S60-G64, 1977) and It was found that there were four bas& pair changes 
leading to three amino add changes when conr^>ared to another virus of adw2 serotype (Valenzuela et al.. 1980 
-Table I page 11). An alanine residue Instead of athreodne residue at position -48, a phenylalanine Instead of a 
teudne at position -34. and a serine reddue Instead of an Isdeudne reddue at |X)sltion -11. The last change 
affects a reddue that had been found Invariable in aH Icnown serotypes until now (Lo et aL, Blochem. Blophys. 
Res. Com., 2, 382-388, 1986). 
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Example 19. Synthesis of paiUdes canying a receptor lor polymerized human serum albumin (pHSA) 

Yeast (S. cerevislae) strains DCS dr^ or 10S44C clr" containing either pRm2660 or pRrri2363 (Example 
16) were grown In selective medium [200 ml YNB + 80 jig/ml histidlne (DCS cir*) or YNB (10S44C dr*)] Into 
log phase. (pRm2363 Is Identlcai to pRm2660 except for the hepatitis derived gene; pRm2363 contains the 
S gene only). Cells were collected by cdntrtfugatfon and resuspended In SO mM sodium phosphate pH 8.0 5 
containing 0^ Tween 20 (polyethylene sorbltan monolaurate), 1 mM PMgp (phenybnethylsulfbnyl fluoride) 
and 2mi isopropanoL Cella were disrupted by passaging twice through a French Press at 20.000 pst (1.38 x 
108 Pa). The suspension was centriftjged for 30 mln. at 30.000 X g. TolaJ protein concentration In the 
supernatant liquid (crude cell extract) was measured by the method of l-owry et al., (J. Bid. Chem.. 193 , 
26&-275, 1951 ), wrtth bovine serum albumin as standard. The presence of HBsAg related material was assayed io 
using a radidmmune assay kit (AUSRIA II) commercially available from Abbott Laboratories. The results (Table 
V) Indicate that pRrri2660 directs the synthesis of material antlgenlcally related to HBsAg particles. The 
expression level, based on AUSRIA activTly. Is about the same In yeast strains DCS dr* and 10S44C clr*. 

In a radtoimmune assay to detect a receptor for polymerized human serum albumin (pHSA) (Pontisso et al., 
J. Virol Methods , 6. 1S1-1S9, 1983), crude cell extracts from pRIT12660 containing cells gave a positive is 
reaction whDe crude cell extracts from pRff 12363 containing cells did not No reaction above badcground was 
observed with polymerized bovine senim albumin. 

CsCi equilibrium centrtfugation of crude ceO extracts firom pRrT12660 containing cells showed the AUSRIA 
positive material to band around rtio - IJZO gfcnfl as do serum derived or yeast derived HBsAg irarticles. 
Material recovered from this portion of the gradient gives a positive result In the pHSA test. 2D 

The aforementioned results show that In pRm2660 containing yeast cells A HBsAg related protein le 
expressed which is assembled Into a stmcttire like that of 22 nm partides from serum and exposing a receptor 
for pHSA on Its surface. 



Table VA. AUSRIA activity in crude cell extracta 



Strain Plasmid Protein HBsAg related HBsAg related 

mg/ml antigenicity antigenicity 
ng/rol ng/mg protein 

DCS Cir« PRIT12660 13.4 7 522 

DCS Cir* PRIT12660 12.6 7 555 

10S44C cir^" PRIT12660 10.8 6 556 

10S44C cir<* PRIT12660 9.5 6 631 
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Example 20. Four S-related protein species are found to t>e expressed In yeast transfonned with pRm2680. 

To analyze the HBsAg related protein monomers expressed in pRIT12660 containing ceHs. ceils were grown GO 
In selective medium, either In Erienmeyer fiaslcs or In fermenters. Into log plwse and collected fay 
centrtfugation. 

To the paciced ceils (0.1 gram), first 20 |il 40 mM PMSF in isopropanol, and then 200 |ii sample buffer for 
SDS-polyacrytamide gel electrophoresis (0.125 M tris-Ha. pH 6.8 containing 20iVb glycerol. Afih SDS. 6 M urea 
and 10IM) 2-mercapto-ethand). were added. Samples were vortexed and allquots of 2 to 20 |iJ were further 55 
diluted in sample buffer to a final volume of 40 (il. Incubated for 5 minutes at 100^ and electrophoresed through 
a 125% separating, Wo stacking gel according to the method of LaemmD (Nature, 680, 1871). 

After electroblottlng (T owbin et al., Proc. Natl. Acad. Sd., U.S A, 76, 3504354. 1979) of the proteins onto a 
nitrocellulose filter (0.45 |im, BloRad), the filter was prelncubated for one hour at ZTO with 3<Vb gelatin In PBS. 
The filter was IncutMited with a monodonal antibody directed against a denaturatlon and reduction resistant 60 
epitope of human derived HBsAg (monodonal 6 obtained from H. Thomas, Royad Free Hospital, Ljondon. 
England) and antibody binding was detected by successive Incubation wKh blotinylated sheep anti-^ouse Ig 
(RPN 1021, Amersham, dilution 1/2S0 in PBS containing 1<Vb gelatin), steptavidlivblottnyiated horseradish 
peroxidase complex (RPN 1051. Amersham. dilution 1/400 In PBS containing m gelatin) andflnaHy 30 |il HaOs 
and HRP Cdor Development reagent containing 4-chloro-1-naphtol (BloRad), 30 mg In 10 mi methand. In SO 55 
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ml res. Between incubations the filter was washed with PBS containing OA^ Tween 20. 

The immunoblot showed 4 protein bands related to the S protein (p23) of estimated molecular weights of 
about 33Kd. 30Kd, 28Kd and 25Kd wMch are detected by the monoclonal antibody. The majority of 8 related « 
material is found In the 33K protein band and was found to migrate sflghtly faster than the gp33 protein 
5 described by Stibbe and Qerilch (Virology, 1^, 436-442, 1982). The smallest protein band detected migrates 
slower than the S protein (p2S) synthedzed in yeast 

Example 21. The four S^laied protein species (of 33Kd, 30Kd, 28Kd and 25Kd) are assembled in particles. 
Crude cell extracts of pRm2660 (Example 16) containing yeast cells were prepared as described in 
10 Exampled 19 or by grinding yeast cells in a Dynomlll In the presence of an equal weight of extraction buffer (200 

mM trie. 3 M NaQ, 10 mM EDTA. 10 mM ethylene glycol bis (beta-amlnoettiyi ether) N,N,N',NMetraacetlc acid 

(EGTA), 4 mM PMSF. 10<Mi isopropanol) and subsequent centrifugation for 45 minutes at 30,000 X g. 
The pRrn2660 directed particles were partially purified from the crude cell extract by uKrafiltrafion, CsCI 

equilibrium centrifugation and hydroxyapatite chromatography which are methods t<nown in the art The four 
IS protein bands of S-reiated material detected by Westem blot analysis copurity during these purification steps. 

An Immuno-biot analysis (as descrik>ed in Example 20) of the AUSRIA positive CsCI fractions after equilibrium 

centrifugation showed that the fbur SHfelated bands were present in the same relative proportions in each 

fraction. 

The material is thus distributed In homogenousfashion throughout the pealc fractione of the gradient 
20 f 
Example 22. The 33Kd and 30Kd proteins cany a receptor for polymerised human serum albumin. 

Human serum albumin (Sigma, A 6763) was polymerised by glutaraldehyde treatment (Merdc, A-12179) and 
purified according to Pontisso et al (J. Virol, methods, 6 151-159, 1973). Reactivity of the four S-related 
proteins towards pHSA was tested in a Westem blot procedure. Pi^tiaily purified preparations of pRIT12660 ^ 
25 directed parHdes (Bcample 21) were separated by SDS polyacrylamlde gel electrophoresis and transferred to 
a nitrocellulose memtxrane as descrft>ed In Example 20. The nitroceiluiose filter was successively lncut)ated 
with pl^SA (15 )ig/ml in PBS containing I^Vb gelatine), rabbit anti-serum to human albumin (Behringwerice, 
ORCB 04/05, 1/125 dilution in PBS containing m gelatin), blotlnyiated donkey anti-rabbtt Ig (Amersham RPN 
1004, 1/250 dilution in PBS containing I^M) gelatin), streptavidin-blotinyiated horseradish peroxidase 
30 (Amersham RPN 1051 1/400 dilution in PBS containing IQfe gelatin) and finally with HzOz and 
4-chloro-lHiaphtoi as descrit>ed In Example 20. 

The two laigest S-related protein bands (the 33Kd and 30Kd) reacted with pHSA, the two smallest ones 
(28Kd and 25Kd) gave no reaction. 

35 Example 23. The 33Kd and 30Kd proteins contain a sequence specific for the preS2 region of the prBS2-S 
protein 

A peptide corresponding in 23 of 26 positions to the 28 N-terminal amino acids of the preS2-S protein was 
synthesized (NH2-Met-QIu-Trp-Asr>-^r-Thr-Ala4>he4fls-Qln-Ala4-eu-Qln^Asp4^ 
|jeu4>ro-AI&-Gly-Cys-COOH) and coupled via the C-termlnal to Iceyhole Limpet hemocyanin (Perdnsula 

40 Laboratories). Rabbits were immunized suticutaneously with 100 }ig of the conjugate (which corresponds to t 
20 [ig peptide) In Freund's complete adjuvant, boosted at days 8 and 15 with another 100 )xg conjugate in 
Freund's complete adjuvant and at days 28, 69 and 149 with 1 00 ^g conjugate in PBS. Development of antibody 
titer was followed by taking blood samples at regular intervals and testing serum dilutions by incubation on 
solid-phase fixed synthetic peptide. Bound antibodies were detected by iodlnated protein A (New England ^ 

45 Nuclear). For immunoblot applications, sera with antipeptide titers greater or equal to 1/5120 were selected. 
The serum was used as such after a 1 00 fold dilution or after partial purification of IgG. IgQ was partially purfiied 
by precipitation with Na2S04 (120 mg per ml of serum), resuspenslon In PBS and repreclpiteition with 14iMi 
Na2S04. The resuspended IgQ solution was desalted by passage on a PD10 column (Pharmacia). 
Detection of reactivity of these antilxxiies towards the S-related proteins in Immunoblots was by 

50 biotinytated donlcey anti-rabbit Ig, streptavldlrv-blotinylated horseradish peroxidase and H2O2 and 4-chloro- 
1-naphtol as described In Example 22. 

Proteins from partially purified preparations of pRrn2660 directed particles (Example 21) were separated by 
SDS polyacrylamide gel electrophoresis and transferred to nitrocellulose as described in Example 20. Also run 
on the gel was S protein (p23) from particles purified from yeast harboring pRrn2363. The immunoblot 

56 showed that the two largest S-related protein bands (the 33Kd and 30Kd) react with the anti-peptide 
antibodies while the two smallest ones (28Kd and ^Kd) give no reaction. The S protein (p23) gives no reaction 
with the anti-peptide antibodies. 

Example 24 Human anti-l4epatitis B antibodies from naturally Infected persons recognize epltope(s) on the 

60 33Kd and 30Kd preS2-^ proteins not present on the S protein. 

Reactivity of pooled gamma globulin from anti-hepatitis B positive human subjects (Belgian Red Cross, 
100-150 lU/ml Lot 85A08) towards the four S-related proteins was tested by the Immunoblot procedure 
described In Example 20. A partially purified preparation (Example 21) of pRm2660 directed particles was 
dissociated by SDS and the proteins separated lay SDS polyacrylamide gel electrophoresis. After transfer of » 

65 the proteins to nitroceiluiose, the membrane vras incubated wfth a ten^ld dilution of anti-Hepatitis B gamm 
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globulin In PBS containing m gelatine.. Antibody binding was detected by successive Incubation wfth 
blotinylated sheep antl-human Ig (Amersham RPN1003, 1/250 dilution In PBS containing 1<Vb gelatin) 
steptavidln-blotlnylated horseradish peroxidase and finally with H202 ml 4-chloro-1-naphtoI as described In 
Example 20. 

The two largest S-related protein bands (the 33Kd and 30Kd band) react with the human antibodies the two 
smallest ones (the 28Kd and the ^Kd band) give no reaction. The S protein {p23) derived from partides from 
yeast cells containing pRrT12363 does not react with these human antibodies. These results show that gamma 
globunn obtained from infected persons specifically recognizes epitopes present on denatured and reduced 
preS2-S proteins which are absent on denatured and reduced S protein. It Is loiown that the epitopes present 
on S protein derived from human semm and yeast are predominantly conformational. 

Example 10A. The 33Kd protein species canles a mannosyl containing oUgosaccharide structure 

PreS2^ protein from partially purffled preparations of particles from pRrT12660 containing 10S44C cir** cells 
was electrophoretlcally separated and tnansfen^ to nltroceOuIose as described In Example 20 Also run on 
the gel were S protein from particles purified from yeast containing pRrn2363 and HBsAg particles from is 
Infected human semm obtained from Dr. W.H. Qerilch. Department of l^ioal Microbiology. Unlvereitv of 
G6ttingen, Federal RepubOc of Germany. 

One half of the nitrocellulose filter was Incubated In 60 jig/ml concanavalin A (Calblochem A grade) In 10 mfA 
tris pH 7A 150 mM NaO 1 mM CaQa 1 mM MnCis OXm> Tween 20 and subsequently with 30 uo/ml 
horseradish peroxidase (Sigma type VI P 8375) in 10 mM trie pH 7.5 150 mM itoCi 0.05<Vb TVwen 20 and l^lv 20 
H2O2 and 4-chIoro-1-naphtoi as described In Example 20. 

Between Incubations the filter was washed wHh 10 mM tris pH 7.5 150 mM Naa 0.06<Vb TWeep 20 (Hawices 
Anal BIpchem. m 143-146. 1982: Clegg. Anal. Bk)chem..127. 38^. 1982). The other half of the 
nitrooellulose filter was incubated with monoclonal antibody 6 and treated as dascrft>ed In Example 20. 

The 33K protein band reacts with the concanavaDn A-peroxIdase reagents whDe none of the 30Kd, 28Kd 2S 
25Kd protein bands or the S protein derived from pRrn23e3 containing ceU reacts. In the presence of 0 1M 
methyl aipharD-mannopyranoside the binding of ooncanavaOn A to the 33K protein Is abolished. 

Example 26. An ollgosaa^arlde structure Is N-glycosldically bound to asparegine 

pRm2660 and pRrT12377 (Example 17) containing ceils were grown In selective medium (YNB + 80 ua/mi SO 
histidlne (DCS clr^) or YNB (10S44C On] to ODeaonm- 0.2, Tunicamycin (Calblochem) was added to aiinai 
concentration of 10 jig/ml and the cuttures incubated for 30 min at 30*C. Then 20 pCi »«S-methlonIne (1000 
Cl/mmoi) (New England Nuclear) per ml was added and the cultures were Incubated another 15 minutes. 2 ml 
of labeled cell culture, either treated or untreated with tunlcamydn, were centrtfuged In a microfuge and the 
paciced cells were resuspended in40 ^ 1(Vb SDS and disrupted by agitation with a Vortex mbcer for 2 min. In the 3S 
presence of 0.3 gram glass beads (diameter -0.45- 0.50 mm). 

Immune precipitation was essentially according to Julius et al.,(Cel!, 36, 30M18, 1984). The iysate was 
heated for 3 minutes (min.) at 100*». diluted with 0.5 ml PBS containing m Triton X100, 0^ sodium 
deoxychdate O.m SDS and heated again for 1 min. at 100*. 

25 |il of a 10<Vb suspension of prewashed Immunoi>reclpmn (Fonmalin-flxed Staph A cells, Bethesda 40 
Laboratories) was added to the boDed extract and Incubated at 0^ for 30 min. TTie mWure was 
clarified by centrifugatlon for 5 min. In a microfuge. To the supernatant Uquld, 2.6 |il monoclonal antibody 6 
specific for S protein (see Example 20) %vas added, and the mbclure was Incubated for 1 hour at 0*. Another 
portion (25 111) of Immuno-predpltln was then added and Incubated for 30 mIn. at O*. The Immune complexes 
were collected by centrifugatlon (5 min. in a micro fuge) and washed with PBS containing 2M urea. PBS 46 
containing 1«Vb 2-meroaptoethanol and fftially PBS. The final washed pellets were resuspended in sample 
buffer for electrophoresis as In Example 20. 

Samples containing 10* cpm were electrophoresed through a 12.5% SDS polyacrylamlde gel (see Example 
20). After elecfrophoresis. gels were fixed In AWt methanol 10<Vb acetic add, treated wfth Amplify (Amersham) 
for fluorography. dried on filter paper under vacuum and subjected to fluorography using Kodak X-Cmat S film 50 
at -70**. 

This showed that pRm2660 containing cells synthesize a 33Kd protein In the absence of tunicamycin and a 
^ *® presence of tunlcanriycia Both bands are absent In the contending samples of 
pRIT12377 containing cells. —r- » r 

This results shows that the 33Kd preS2-S protein expressed In yeast strains DCS di* carries an ^ 
oligosaccharide portion N-glycosydlcally bound to asparaglne. There Is most probably only one 
oHgosaccharide chain attached to the 33Kd form of the preS2^ protein which is not much lamer than a core 
gtycosylatlon. 

E^P'o 2^- The oDgosaccharlde structure is of the hlgh-mannose type. sq 

Crude cell extracts from pRm2660 containing cells or partlaliy purified preparations of preS2-S containing 
particles from these extracts (Example 21) were treated wfth Endo-N-acetylglucosaminidase (Endo H.; 
EC.3.2.1.g6) from Streptomycespilcatus obtained from New England Nuclear. 

A P"^®^ particles (50 jU, 450 jig/ml protein) were boiled for 3 minutes at 100** In the presence of 

0.m sodium dodecyl sulphate, diluted ten-fold wHh 100 mM sodlunhcttrate pH5 and to 240 |il of this mixture 66 
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2.4 yd Endo H (74 ^g/ml) was added. Samples with and without added Endo H were incubated for 4 hrs at 37^. 
Analysis of the Endo H treated and untreated samples tsy SDS-polyacrylamide get electrophoresis and 
immunot}lotting with detection t^ the monoclonal antibody 6 directed against the S protein as described In 
Example 20, showed the disappearance of the 33Kd t>and and the appearance of a 31 Kd t)and upon Endo H 
5 treatment The 31Kd band stOI reacts with the concanavaiin A-peroxidase reagents as described at>ovB. TTie 
position of the other bands (SOKd, 28Kd and 25Kd) did not shift upon Endo H treatment ijonger Incubation 
times or higher Endo H concentrations gave identical results. The result shows tfiat the oligosaccharide 
structure linlced to the 33Kd protein is of the Hgh-mannose type and has atx)ut the size of a core glycosylation. 

10 Example. 28 Purification of prsS2-^ protein containing particles. 

Crude yeast extract containing preS2-S particles is prepared tyy grinding the washed yeast ceils in a 
DynomiO In the presence of an equal weight of extraction buffer, either 50 mM sodium phosphate pH 8.1, 4 mM 
EDrA1iWb1\veen20.4mMPMSFand10(Miisopropandor200mii4^^ lOmMEDTA, 
10 mM EOTA (etfiylene glycol bis (b6ta)amlnoethyl ether N.N.N'.N'-tetre acetic acid). 1<Vb Dufeen 2D. 4 mM 
IS PMSF and lOQ/b IsopropanoL 

The homogenate Is adjusted to pH 8.0 or 9.0 respectively and subsequentiy centrlfuged for 45 minutes at 
30.000 X g. 

To 500 ml of the crude yeast extract, adjusted to pH 7.2. Is added 1 0 gr colloidal Silica (Aerosil 380 Degussa). 
The colloidal suspension is stirred over night at 4" and subsequently centrlfuged at low speed. The AerosQ 

20 peDet Is washed three times with 150 ml 0.15 M NaCI for 1/4 hour and then eluted batchwise witii 250 ml 10 mM 
pyrophosphate buffer (pH 9.3) containing Tween-20 at 37** for 3 hours. The desorbate. a yellowish. Dghtty 
opalescent solution, is passed over a 50 mi column of phenyltK>ronate Agarose (Amicon) equilibrated in 10 mM 
pyrophosphate buffer pH 9.0. The flow rate is 15 mi/hour. 
The column is further washed with 2 column volumes of equlDbration buffer, and is then eluted in tfie 

25 revereed direction with 100 mM sorbitol in 50 mM Tris (pH 7.2) at a flow rote of 16 ml/hour Eluted pealc 
fractions are pooled and after adjustment to pH 8.1 , passed over a DEAE-Fractogel TSK column equilibrated in 
50 mM Tris pH 8.1. The particle eluted in the same t>uffer containing 75 mM NaCL After this step residual 
nucleic acids were removed (<1 )ig/20 |ig protebi). The t>alance stieet is shown in Table VI. 
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Tablft V. TWO step ptirif Icatlon of priBS2-S containing 
particlea 



Fraction 



mg s-related 
protein mg 
ml (RIAl protein 



mg 

^9 poiy- 
lipids sacchartdflg 



10 



Crude 
extract 



500 



11.3 



14,600 



5,286 



7.850 



IS 



Desorbate 
aerosil 



270 



5.96 



385 



554 



249 



20 



Flow 

through 

PBA-column 



315 



2.83* 



365 



360 



267 



Eluate 
PBA-column 



70 



2.8 



2.9 



2.5 



30 



*not sufficiently glycosylated to be retained on the 
column. 



Material purified from extracts of ceQs containing pRrT12660 as described In this Example 28, followed by a 40 
CaCI equilibrium centrifugatlon, was prepared for electron microscope examination after staining with uiwiyl 
acetate. Examination showed the presence of discrete particle structures about 19 microns In diameter, and 
that such hybrid particles contained between 5 to 20 |ig IVveen 20 per 100 iig protein. 

Example 29. Affinity chromatography on lentil sepharose. 45 

In this example, the phenylboronate agarose chromatography step Is replaced by affinity chromatoarBphv 
on lentil sepharose (Pharmacia). ^ -t« j 

50 ml of Aerosll^esorbate, obtained as described In Example 28, was passed over a 5 ml column of lentil 
Agarose equHlbrated in equilibration buffer such as Tris buffer 10 ml^ (pH 7.2). 0.14 M Naa and 0.3% T\¥een. 
The column was washed with 2 column volumes of equIDbratlon buffer such as Tris Buffer 10 mIM (pH 7J2) and so 
the eluted with sugar additive with specificity for lentil such as OJ M aipha^ethylmannopyranodlda In 
equilibration buffer. The balance sheet is shown in Table VI). 
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Table VII > Application on lentil Sepharose affinity 
chromatography in the purif icatioa on 
preS2-S containing particles . 



Volume )i9 S-r elated Total 

Fraction <ml) protein (RIA) Protein 

Crude 100 2170 3 gr 

Desorbate aerosil 50 1700 138 mg 

Flow through 50 428 
Lentll-sepharose 

Eluate 12-5 925 1.26 mg 
Lentil-sepharose 



Examination revealed that the hybrid particles purified as described herein contained between 5 to 50 jig 
Tween 20 per 100 )ig protein. 



Example 30. Immunogenicity of the preS2-S particle. 

The immunogenicfty of particles prepared from extracts of cells containing pRin2660 (Example 16) was 
studied In 2 strains of mice: Balb/c (H2-d) and NMRl (H2-Q). The particles were absorbed on AI(0H)3 before 
injection and the mice (5 mice/group) were injected Intraperitoneaily (IP.) with 0.3 ^ of RIA reacting material. 
^ One month after the immunization, the mice were bled and the sera from the mice of the same group were 
pooled. The titer of anti-8 antibodies was detemnlned In the pools by using the AUSAB kit (Abbott Labs) and 
the WHO reference. The titera were converted in mlU/ml by using the HoiOnger fbnnula (Holllnger et al.. In 
&muness et sd., eds. Viral Hepatitis. Philadelphia : FrankOn Institute Press. 451-466. 19^« The anti-preS2 
antibodies were determined using a solid i^ase ass^ (RIA) in which the synthetic preS2 peptide given In 
^ Example 23. Is ^sorbed on plastic and bound antibodies were detected %vlth a goat antkmouse IgG labelled 
wtth i25|. results summarized In Table Vlil show that the particles from pRrT12e60 containing cells Induce 
both anti-S and anti-preS2 peptide antibodies in both strains of mice with Ngher titere of anti-prBS2 peptide 
antibodies Induced in NMRl than in Balb/c mice. The anti-S titer obtained In Balb/c or NMRl mice Is 2 to 3 times 
higher than the titer obtained with a higher dose of particles from pRm23j63 containing ceils which do not 
^ contain the preS2 sequence. PreS determinants enhance the response to the S determinants. This enhancing 
effect is not shown if the preS2 synthetic peptide Is Injected together with particles which lack the preS2. 
Moreover, the preS2 synthetic peptide alone is unable to elicit anti-preS2 antibodies. 

This Indicates that the association of preS2 determinants and S detemnlnants on the same particle Is 
required to obtain an antH>reS2 response. Furttiemnore this association results In an improved anti-S 
response. 

The results of the immunogenicity studies referred to herein related to the PreS2^ particle are summarized 
In Table VIII. 
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Table VII I, Immuno oeniclty of the DreS2-S partieia 



Titre of 

Preparation Dose of S Anti-3 antibodies Anti--preS2 
injected related (inIU/ml)(2) antibodies (3) 

antigen 

(yg)(l) Balb/c NMRl . Balb/c NMRl 



Yeast-derived 1.7 1404 719 -0 2 

S particle 
directed by 
PRIT12363 

Yeast-derived 1.S5 748 1163 0 4 

S particle 

directed by 

pRIT12363 

+ synthetic 

preS2 peptide: 

0.08 iig 

Synthetic - 2 10 4 

preS2 peptide: 
0.08 pg 

Yeast-derived 0.3 3344 2418 8 120 

(preS2-S) 

particle 

directed by 

PRIX12660 



(1) determined by using the AUSRIA Kit (Abbott Labs) and the Proposed International Reference 
Preparation supplied by Dr. Schlkj, National Institute of Biological Standardization. London, as standard. 
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(2) determined by using the AUSAB kit (At>bott Labs) and the Worid Health Organization standard. 

(3) detennlned tiy using a soBd phase radio hrnmuno assay and synthetic preS2 peptide adsort>ed to 
plastic. 

The titer Is the dilution which gives a significant binding of the antibody (2^ times the value obtained with a 
S negative control serum). 

EXAMPi-ES OFTHE EXPRESSION OF THE EMTIRE PreS1-PreS2-S PROTEIN COPINQ SEQUENCE IN YEAST 

Example 31. Construction of plasmid pRm2845 that expresses the preS1-prBS2-proteln In yeast 

10 The first step was the fusion of the TDH3 promoter region with the pre SI N-termlnal region from the HBV 
genome. The starting material was plasmid pRrT12792 that harbors a 495 bp Ncoi-Xbai fragment containing 
tiie pre Sl-Pre S2 coding sequence except for the last asparagine codon. The Nco l site oveilaps the ATQ 
oodon of amino add residue -1 63 of the HBV pre SI sequence. The sequence of the Ncoi-Xbal fragment is 
shown in Rgure 4A. 

IS An essentially slmflar plasmid to pRn"12792, identified as pRrT12793 (5840 bp), was deposited In the 
American Type Culture Collection. RockvOle. Maryland, U.SA., In accordance witii the regulations of the 
BudapestTreaty under accession number ATCC 67193 on Septembers, 1986. From plasmid pRm2793,a4S5 
bp Ncol-Xbal fragment can be recovered which contains a complete pro Sl-pre S2 coding sequence. This 
fragment differs from that shown In Rgure 4A by one nucleotide having the sequence ACGAACTAATAATC- 

20 TAQA at the 3^ end of the fragment The nucleotide difference Is underscored. In the following examples, 
pRm2792 can be replaced witi) pRrn2793. Botti the Ncol-Xbai fragments contained pRrri2792 and 
pRin2793 were derived by In^tre inanlpuIaUon of DNAfrom ttie PreS1-PreS2 region of the DNA of Hepatitis B 
virus strain (serofype adwO cloned on plasmid pRin0616. Plasmid pRIT10616 Is available from the American 
TVpe Culture CoReetion under accession numt)6r ATCC 39131. 

2S 52 |ig of pRin2792 were digested with the Nco l and Xbal restriction enzymes and treated with Mung Bean 
nuclease In order to eliminate the 5' extensions. 600 ng (2 pmd) of the treated and purified 495 bp NoolXbal 
fragment were ligated to 170 ng (0.02 pmol) of pRrn2314 vector that had been previously opened by BamHI 
and Smal enzymes and also was treated with mung bean nuclease. pRm2314 contains the entire 2 micron 
fragment from S. cerevlslae and an expression cassette in which the TDH3 promoter Is separated from the 

30 ARG3 terrrUnator by a BamHI-Smal-EcoRI Gnker: 



BanlH I EcoRI 

35 



Smal 



^ AT66ATCCCCGGGAATTC .... 

pTDHj tARG 



A more detailed description of the construction of pRrT12314 is found In Example 10(B) above. The ligation 
mix descrit>ed above was used to transform E. toO MM294 to ampldllin resistance according to tfie method of 
Cohen et al. cited above. Six transfbrmants containing a plasmid in which the pre Sl-pre S2 fragment was 
Inserted in the correct orientation were retained. These were the plasmtds pRrn2843 (a....f) (Figure 18). 

50 The second step was the Insertion Into each plasmid pRm2843 (a.....f) described above, of the S coding 
region to reconstitute the entire pre S1-S2-^ gene. 

This was done as fbilowa: plasnrdds pRm2843 (a.....f) were each digested with BamHI and Sail enzymes. The 
BamHI site Is placed at the beginning of the pre S2 region, the Sail site Is placed behind the ARG3 tennlnator in 
pBR 327. 80 ng (0.012 ^ol) of the largest BamHI-Sail fragment (10,400 bp) purified from each plasmid 

55 pRm2843 (a.....f) were ligated to 300 ng (0.11 pmol) of the small BamHI-Sall fragment from pRrn2660 
(Example 16). The small (4800 bp) BamHI-Sal l fragment of pRm2660 contains part of the pre S2-S coding 
sequence foDowed by the ARQ terminator sequences and the LEU2 yeast sequence. After ligation and 
transformation of E. coli MM294 witti each of plasmids pRIT12843 (a.....f) according to the method of Cohen et 
al., dted above, a series of ampldllin resistant clones containing the plasmids named pRrn2845 (a....f) was 

SO obtained. Figure 18 is a flow sheet Qlustrating preparation of pRIT12845 (a.....f). These plasmids pRm2845 
(a....f), yNBVB used to Individually transform the S. cerevisiae strain 10S44C dr* according to the method of tto 
et ai.. dted above. The screening of cultures grown from Individual dones from the yeast transformations was 
done by testing crude cells extracts in the commerdally available AUSRIA test (Abbott). Crude ceU extracts 
were made by disrupting yeast cells resuspended In PBS, containii^ QJSfih T>ween 20, 2 mM PMSF and 5% 
isopropanol. One transformant of each of the six separate transformations [with pRrn2845 (&.«.f)] was 
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tested. Five out of six tested transformants gave a positive AU8R1A. 

To test whettier a protein of the correct size gave the S-related antlgenldty detected by the AUSRIA. sam 
pies of the crude extracts described above, were subjected to the Immunoblot procedure described In 
example 20. Four of the five AUSRIA positive transformants showed a S-reiated protein of about 39K. One 
transformant (extract a) showed a S related protein of 23K. Retained as pRm:iB45 was the yeast tranformant 5 
expressing the 39K protein present In extract (e). See, Figure 18. 

Example 32. The preS1-preS2-S protein is assembled Into particles carrying a receptor for polymerized human 
serum albumin and a preS1 epitope on their surface. 

Crude cell extracts from yeast strain 1(»44C dr* harboring either pRrri2846. pRm2660 or pRrT12377 and 10 
yeast strain DCS cir* hariE>oring pRrn2383 were prepared as described In Exarnpie 19. Total protein 
concentration. HBsAg related antigenicity and detection of a receptor for polymerized, human serum albumin 
were performed as described in Example 19. The result showed a strong binding activity for polymarlsed 
human serum albumin present In extracts from strain 10S44C cir' containing pRrn2843 and pRrT12660 (see 
also Example 10) which is absent in extracts from strain 10S44C dr** harboring pRm2377 or strain DCS cir' is 
harboring pRrn2363. 

Csa equilibrium centrifugation of crude cell extracts from 10S44C dr^ cells containing pRIT12i34S 
(described In Example 12) and measurement of HBsAg related antigenicity by AUSRIA assay in the CsCI 
fractions, showed the antigen to band around rtio - 12 g/cm^. This was confirmed by Immunoblot analysis of 
the Csa fractions. Pre SI antibody binding acth% In the CsQ fractions was tested by the EUSA assiy SO 
described in Example 12. After aflowing the antigen to bind to the soDd-phase anti-HBsAg. the bound antigen 
was incubated with monoclonal antibody MA18/7 (Example 31) which was then detected by blotinylated 
anti-mouse Ig. strsptavldln-biotinylated horseradish peroddase oom- plex and chromogen as described In 
Example 12. 

The pre SI specific antibody binding actNity was found to ooequUibrate with the HBsAg related antigenicity 2S 
In the Csa gradient TTiese results show that the pre Sl-pre S2-S protein produced bi 10S44C dr" cells 
harboring pRIT 1284S Is assembled into particles lOce the 22 nm HBsAg partldes derived from serum. The 
partides expose pre SI specific sequences as weD as a receptor for poiymerised human senim albumin on 
their surface. From the level of activity of those partldes In the AUSRIA test and the intensity of the reaction of 
the pre SI -pre 82-8 protein In an Immunoblot analysis, it can be conduded that the partlde's 8 determinants so 
are partially obstructed or deformed by the presence of pre S sequences In tiie partlde. 

Example 33. Two proteins of molecular weight of about 38 and 45 kilodaltons (IcD). both carrying preSI. preS2 
and S epitopes are expressed In yeast transformed with RrT12845 

HBsAg related protein monomera expressed in pRm2845 containing cells were analysed as described In as 
Example 20. 

Migration and Immune reactivity of the cellular proteins were compared to the proteins from HBsAg 
partides (serotype ad) purified from human serum (obtained from Dr. W.H. Qeriich. D^MUlment of Medical 
MIcrobidogy. University of QOtttngen, Federal Republic of Germany). 

Three immunoreagents were used to analyse the proteins: 40 

- the monoclonal antibody (obtained from H. Thomas -Example 20) recognizing a S epitope 

- a polyvalent rabbit antiserum raised against the synthetic pre 82 peptide (described in Example 23) 

- monodonal antibody MA18/7 specific fbr a pre 81 epitope as described by Heenrann et al. (J. Virol., 52, 
396-402, 1984). — 

The immunoblot showed, among the cellular proteins derived from pRIT1284S containing 10S44C cells, the 4S 
presence of two proteins reacting specifically with all tfiree Immunoreagents described above. The mdecular 
weight of these two proteins was estimated to be about 38.000 and 45,000 daltona. The mdecular weight 
estimation was based on ttie migration of the protdns from serum derived HBsAg as defined by Heemian et al. 
(J. VIrd., 52, 396-402. 1984). The 38,000 dalton pre Slijre S2-S protein directed by pRm2B46 migrates silghtiy 
faster ttian the p39 pre Sl-pre S2-S protein derived from HBsAg partides. The p39 protein from human senjm SO 
partides of serotype ad presumably has a pre 81 region of 119 amino adds (Heermann et al., J. VIrd., 52, 
396-402. 1984). pRrri2845 codes for a pre 81 region of 108 amino adds. 

The above results show that yeast cdls hart)oring pRIT12845 express two proteins of about 38 and 45 kD 
canying pre 81, pre 82 as well as 8 epitopes. 

ss 

Example 34. The 45 Kd preSI -pre82-8 protein spedes canles a N-glycosldlcally bound high-tnannose type 
digosaccharide chain 

Cells hariDoring pRrT12845 were disrupted witti an equal weight of giass beads (diameter 0.45-0.50 mm) In 
an equal weight of 10 mM sodium phosphate buffer (pH 7.4) containing 2% SDS and 8 mM EDTA by agitation 
wrth a Vortex mixer. ^ 

After centrifugation. the supernatant was heated for 4 minutes at KX)^. 

10 of tile heated supernatant fluid was diluted witii 40 |il 25 mM sodium dtrate buffer (pH S.0), and 2 )U 
EndoH (74 jig/ml) (see. Example 27) was added. Samples wRh and wittiout added EndoH were Incu- bated for 
2.5 hre at 37**. 

The EndoH treated and untreated samples were analysed by SDS-pdyacrylamide gd electrophoresis aftd es 
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immunoblotiing as described in Bcample 20. 

Tlie blot was analysed with monoclonal antibody 6 (S specific) and MA18/7 (pre S1 specific) (see. Example 
33). 

The Immunoblot showed both with monoclonal 6 and monoclonal MA18/7, the disappearance of the 45 kD 
5 band and the abearance of a41 kD band upon Endo H treatment The migration of the 38 kD t>and was not 
affected by the Endo H treatment 

These results show that the 45 kD pre S1-pre S2-S protein species carries a N-glycosidlcally bound 
high-mannose type oligosaccharide chain. 

10 Example 35. The 38kD preS1-preS2-8 protein contains covalently bound myristto acM present in an amide 
linkage 

Cellsfrom yeast strain 10S44Ccli^ harboring either pRrri2845,pRrn2660 or pRrn2377 were grown In YNB 
to an 0D620nm of 0.4. 

Cells (20 ml culture) were labeled for 60 min with 1 mCi of 3H-iabeIed myristic acid (New England Nuclear) 
IS and then collected by centrifugation. Cells were washed with cold H2O, resuspended In 100 ^ 5 mM tris-HCI 
(pH 7A) containing 3 mM (DTT, 1<Vb SDS and 1 mM PMSF. and broken with 100 mg glass beeds by six 
30-second spurts of vigorous mbdng In a Vortex, cooling on ice between each mixing. Debris was removed by 
centrifugation for 1 minute In a Eppendorf 5414 centrifuge fTowler and Giaser, Proa Natl. Acad ScL, 83, 
2812-2816 J986). 20 |J of the extract was treated for 3^ hre at room temperatiro with 7 |U 
20 M hydroxylamine. 20 mM glycine. pH 10. 

Hydrracylamlne treated and untreated samples were subjected to SDS poiyacrylamide gel electrophoresis 
and fhJorography as described in Example 26. The hydroxylamine untreated samples were also subjected to 
immunoblot ana^sls with the monockmal antibody recognizing a S epitope, as described in Example 20. 
The immunobtot showed results identical to those described in Examples 20 and 33. 
25 The fluorograph showed a labeled band of 38kD present only In the cellular extract from cells hart>oring 
pRIT12845, which was hydroxylamine-stable. Indicating that the 3H-myristate label Is present In an amide 
linkage. No lat>eled bands specific for ttie extracts from cells hart>oring pRfT 12660 were detected. 

These results show that the 38kD pre S1-S2-S protein contains covalently t>ound myristk: acid present in an 
amide linkage. In general, myristate t>ecomes covalently t>ound to proteins via amide iinicage to an 
30 amino-terminal glycine. This suggests that the initiator Met has been removed from the pre Sl-pre S2-S 
protein and that the Qly residue at position 2 has become available for acylatlon with myristic acid. 

Example 36. Use of preSI vectore for insertion of functional DNA coding sequences 
The starting material is DNA of plasmld pRIT 12793 described in Example 31 above. pRm2793 contains the 

35 pre Sl-pre S2 coding sequence on a 495 bp Ncot-Xbal fragment DNA of plasmid pRIT12793 is digested with 
Ncol restriction endonudease. treated with 74 DNA polymerase to fill In the sticky ends and digested with Xbal 
endonudease. The Ncol/T4 DNA polymerase/Xbal fragment is purified by poiyacrylamide gel electrophoresis 
and electroelutkm and Inserted on vector pUC12 previously digested with Smal and Xbal endonucieases. The 
cloning vector pUC12 Is commercially avaRable from Amersham (Uttle Chaifont, Bucks, United Kingdom) and 

40 from Pliarmacia (Piscataway. NJ). The plasmki pRfTX is obtained as shown In Rgure 5B. In the pre Sl-pre S2 
region of plasmki pRITX, a single Nco l site overtegss the ATQ start codon of the pre SI region, and a Xbal site Is 
present adjacent to and downstream of the TAA stop codon at the C-teimlnus of the pre-S2 coding 
sequences. A single BstX restriction site is present on the plasmk) dose to the N-tenninal end of the pre SI 
coding region. 

45 It will be appreciated one skilled In the art that the vector pRITX or similar derivative vectore containbig 
the pre S1-S2 sequences may be used for introduction of further functtonal DNA coding sequences encoding 
peptides of interest to create In phase lUskms within the pre SI region by insertton at the BstX site. 

Furthermore, functional DNA codng sequences may be introduced by digestion of pRFTX and derivatives 
with combinations of BstX and BamHI or EcoRI or endonucieases and insertion of the functtonal DNA 

SO coding sequence between these sites, thus removing the greater part of tfie pre SI coding regton and the 
N-termlnal part of the pre S2 sequence. Such fusions will be of the type pre Si-introduced functional DNA 
coding sequence pre S2. Once such fustons are made, they can be combined onto other ptasmkls containing 
the rest of the pre Sl-pre S2 sequences and sequences necessary for maintenance and repllcatk>n in E. coil 
and S. cerevisiae as exemplified in Example 16A. 

55 

Example 37. Fusion of peptide from proteins of HTLV-lll virus to the preS2 region of preS2-^ sequences to 
create novel hybrid particles 

The preS2 region of a HBV preS2-S coding sequence can also be used for the insertion or fusion of peptide 
sequences of other viral antigens Including those of the causative agent of human AIDS ^ndrome, the 

60 retrovims known as HIV. HTLV-lli or LAV. 

The doned genome of a HIV vims has been described by Ratner L et al (Nature. 313. 277-284, 1985) and 
Shaw G. et al. (Science. 1165-1171. 1984). From this genomic done various subfragments can be 
obtained as functional DNA coding sequences which correspond to peptide regions of Interest for fusion to 
tiie preS2-S sequence to form novel hybrid particles containing epitopes of the HIV virus proteins. Such hybrid 

€5 particles can serve as tiie basis of a human vaccine for protection against the Infective agent HIV. 
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Among the peptide regions of Intere^ are the following: 
a C7 peptide. 

This region con^sponds to a stretch of 45 amino acid residues Immediately adjacent to the processing site 
of the envelope precursor and was defined by Stardch B.R. et ai. (Cell, 45, 637-648, 1986). s 

b. Peptide 121. 

TOs region con^sponds to a relatively conserved amino acid sequence present In several retroviral 
transmembrane envelope proteins (see Ciandoio Q J. et aL (Science, 468^, 1985). This sequence is 
thought to participate In the pathogenesis of retroviraHnduced Immunosuppression (Snyder man R. and 10 
Clanciolo QJ.. ImmunoL Today, 6. 240, 1984). 

c. 'Pressman peptide*. 

A synthetic peptide conrespondlng to amino add sequence 735 to 752 of the precureor envelope 
glycoprotein of HIV was used by Kennedy R.C. et al. (Science. 1556-1659, 1986) to immunize rabbits and 15 
rabbit antiserum was used to Investigate the recognition of HfV envelope proteins by induced antibodlea The 
results suggest that this region could Induce an Immune response directed against the native glycoprotein. 

A. Fusion of the DMA sequence connesponcfing to the C7 peptide from a HTLV-lll isolate to the or^SSrS reolon 

on pRfT10911. ^ 

The startihg material was plasmid pBH10-R2 containing the cloned genome of HIV isotete BH10. This 
plasmid was obtained from R.C. Gatlo. National Institutes of Health, Bethesda. i^D. A 3108 bp Sall-Xhol 
fragment of DNA of viral origin which was exdsod from pBH10-R2 and cloned between the Sail and )Oiol sit^ 
of PUC18 to create plasmid pRrT12901. A restriction map of this Sali-Xhoi fragment is shown in Figure IB. 
Plasmid puas Is described by Nonwider J. et aL (Qene. 26, 101-106. 19^ and Is commercially available from 
Phamiada Inc., Piscataway N J. 

To create the fusion. DNA of pRm2901 was digested with Bglli and Mboll endonucleases. and the 117 bp 
fragment Isolated by acrytamlde gel electrophoresis and electroelution (see, Rgure IB). DNA of this fta^ent 
was treated with Mung Bean nuclease tp remove single stranded extensions and Ogated with DNA of 
pRiTI 091 1 (see. Example 5. above) which had been digested with BamHI endonudease and treated with Mung 
Bean nuclease. From this ligation mixture a plasmid. pRrn2B93. was isolated in wWch the 117 bp BgllNMboll 
fragment from pRiT12901 has been Inserted on pRfT10911. The nucleotide sequence across the ftSon 
Junction between the TDH3 promoter and the 117 bp HTLV-lil Bglll-Mboii. mung bean nuclease treated 
fra^ent Insert on pRrn2893 was determined and it was found that 10 nucleotides In excess had been 
removed from the Bglli end of tfie fragment. The sequence found was ss 

S * ATGGAGGAGGAGATATGAGQGA 3 ' 

wheretheflrstATQcodonunderscoredlsthatoftheTDHSpromoterregionfra^ ^ 
an ATQ codon internal to the 07 peptide fragment. The second ATQ condon overtaps a TQA tanninatlon 
codon. This temdnatton codon Is in the same reading frame as the first ATQ codon. DNA sequence 
determination of the end of the fusion region on pRiT12893 showed the correct sequence for In phase fusion 
of the Mboll-mung bean nuclease treated end of the HTLV-lll fragment to the preS2 sequence on pRITIOOl 1 as 
shown In Figure 2B. 45 

DNA of pRrT12893 was then digested with HIndl ll endonuclease and the 0250 bp fragment isolated and 
purified by agarose gel electrophoresis and electroelution. The 3250 bp HIndlll fraginert was then inserted at 
the HIndlll site of plasmid YEp13 to create pRm2894. DNA of the plasmidwas then used to transform ceRs of 
yeast strains DCS and 10S44C to leucine independence by the method of Ito et aL as dted in Onmple 10 
at>ove. 50 

Transcription and translation In yeast ceOs of the C7-preS2-8 fusion DNA on pRm2894 wOi result In 
synthesis of a small 5 residue peptide commencing at the TDH3 ATQ codon and tarmlnaUng at the TQA codon 
imdmscored above and of the C7-preS2^ fusion peptide commencing at the second ATG codon internal to 
the C7 sequence. This fusion contains 38 amino add residues from the C7 peptide In ptece of the 45 residues 
from the intact Bgiii -Mbol l-mung bean nuclease treated C7 fragment It will be appreciated that other plasmlds ^ 
recovered from the same ligation mixture as pRrT12893 can be examined and sequenced to detemihe If they 
cany a fusion corresponding to the complete fragment 



B. Fusion of the DNA sequence corresponding to the Peptide 121 to the preS2-8 region on pRIT10911. 

To create the fusion. DNA of pRrT12901 (described In Part A. above) was digested %vtth Hhal and HIndlll 
endonucleases to Hberate the 230 bp fragment shown m Figure 1 B. This fregmefit was purified^ aciyiimide 
gel electrophoresis and electroelution and treated with T4 DNA polymeraee to fiH in single strand extenelons. 
The so treated fragment was Dgatsd with DNA of pRfTIOOII which had been digested with BamHI and Mung 
Bean nucieass. From this ligation mixture, a plasmid was identified as pRm2897 on which the^bp fragment 
of HIV DNA has been inserted in the correct reading frame of fusion to the prBS2 sequence of pRHOOII as 



49 



25 



SO 



60 



65 



0278940 

shown in Figure 3B. DMA of pteismld pRFTI 2897 was then digested with Htndlli endonuclease. and the 3365 bp 
fragment containing the TDiH3 promoter, HlV-preS2-S fusion and the AR G3 terminator was purified by agarose 
gei electrophoresis and electroelutfon. This 3365 bp fragment was Ilgated onto YEp13 digested with Hindlll 
endonuclease to form plasmid pRiT12898. DNA of this plasmid (pRiT 12898) was then used to transform cells 
5 of yeast strains DCS and 10S44C to leucine Independence by the method of Ito et al. as dted in Example 10, 
above. 

C. Fusion of the DNA sequence corresponding to the 'Dreesman'peptide to the preS2 region on pRm091 1 . 
A 60 bp fragment of synthetic DHA with the sequence shown below was synthesized by conventional means 
10 as two single strands which were then annealed together: 

5' 6ATCTCGACA6GCCC6AAGGAATA6AA6AAGAAGGTGGA6AGA6A 3* 
3 • A6CTGTCCGGGCTTCCTTATCTTCTTCTTCCACCTCTCTCT 5 • 

IS 



20 



5* GACA6AGACAGATCCCC6 3> 
3' CTGTCTCTGTCTAGGGGCCTAC 5' 



This fragment has 5'^ Bgill and 3" BamHI single strand extensions. About 200 ng of a double stranded 
^ fragment and 300 ng of DNA of plasmid pRIT10911 digested with BamHI were mixed and ligated. From this 
ligation mixture, a plasmid. pRrri2899. was Identffled and recovered on which the 60 bp fragment has been 
inserted at the BamHI site of pRITIOOII so as to create an inframe fusion as shown In Figure 4B. DNA of 
pRlT12899 was digested with Hindl ll endonuclease, and the 3200 bp Hindl ll foment was purified by agarose 
gel electrophoresis and electroelutlon. This Hindl ll fragment was inserted at the Hindl ll site of YEp13 to create 
^ plasmid pRm2900. DNA of plasmid pRm2900 was then used to transfbnm ceils of yeast strains DCS and 
10S44C to leucine independence tyy the method of Ito et al. dted in Example 10, above. 

Example 38. Expression of fusion proteins from pRiR12894 and pRm2898 
Yeast strains DCS or 10S44C hart)orlng either YEp13. pRm2363, pRiT10912, pRrri2894 or pRIT12898 were 

^ separately grown in liquid medium lacking leucine (YNB + 80 |ig/ml hlstidine or YNB). and cellular proteins 
were analyzed t^y the Inrununobiot procedure as described in Example 11 using as the first antIt>ody a 
monoclonal antibody directed ag^nst a denaturation and reduction resistant epitope of human derived HBsAg 
(monoclonal 6 obtained from H. Thomas, Royal Free Hospital, London, England). pRin0912 is described in 
Example 5 above. pRrn2894 and pRm2888 are described in Example 37, above. pRm2363 canies the S 

^ gene off HBVfUsed to the TDH3 promoter as the 2900 bp Hindlll fragment from pRrri2322 (Example 8) Inserted 
on ayeast vector plasmid Identical to pRm2377 (Example 10. part B above) and produces HBsAg under the 
control of the TDH3 promoter. 

The immunoblot showed a protein band related to S protein of estimated molecular weight of about 32 kD 
among the total cellular proteins from yeast strains DCS or 10S44C transformed with pRm2894. A protein 

^ band related to S protein of estimated molecular weight of about 4S kD was present among the total cellular 
proteins from yeast strains DCS or 10S44C harboring pRm2898, while a protein band related to S protein of 
estimated molecular weight of about 29 kD was present among the total celluiar proteins from yeast strain DCS 
harboring pRm0912. 

Crude cell extracts from yeast strain DCS harboring either pRm2363. pRm6912, or pRn'12894 were 
^ prepared as described in Example 1Z Protein concentration and AUSRIA activity ware measured as described 
in Example 12. Immunoblot analysis of these extracts was performed as described above. 

Csa equilibrium centrifugatk)n of the cnide extracts, (described in Example 12) and measurement of 
HBsAg related antigenicity by AUSRIA asssy In the CsCI fractk)n8, showed the antigen to band around 
rtio » 1.2 g/cm^. This was confinned by immunobtot analysis of the Csd fractions. 
^ These results show that the 32 kD fusion protein produced In yeast strain DCS cells carrying pRm2894 is 
assembled Into partides Hke the 22 nm HBsAg particles derived from human serum. From the level of activity 
of those particles In the AUSRIA test, and the intensity of the reaction of the 32 kD protein in the immunobtot 
analysis, it can be concluded that the particles* S determinants are partially obstructed or deformed by the 
presence of C7 sequences. 

^ From these results it is concluded that yeast ceils of strains DCS and 10S44C transformed with either 
pRrn2894 or pRrri2898 specifically synthesize a fusion protein of molecular weight of about 32KD or 45 kD 
respectively, both carrying a S epitope. 

It wUI be appreciated by one of skill In the art tiiat hybrid parttoies canylhg the HIV vims peptide epitopes can 
be extracted and purified tiy conventkinal means firom cultures of yeast cells transformed with pRn'12894, 

^ pRin2898 and pRnri2900. Such purified particles may then be formulated in tiie presence of adjuvants such 



SO 



0278940 

as AI(0H)3 or AIPO4 as vaccines for human use. The prefanBd dose wfll be In the range of 1-1000 |ig protein 
and is determined by conventionai trials. It will also be understood that the invention as exempttfled herein 
Includes other peptide regions from HIV viral proteins which can be fused to the prBS2-S region of HBV coding 
sequences to form hybrid partlcies presenting the HiV epitope of interest. 



aalms 

1. A recombinant DMA molecule comprising a functional DNA coding sequence fused, in phase, to a 10 
portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence wherein the functional DNA 
coding sequence comprises the PreS2 coding sequence, PreSI coding sequence or entire PreS1-PreS2 
coding sequence, the drcumsporozoite protein coding sequence of Piasmodlum. or a HiV coding 
sequence such as a HIV envelope gene sequence such as the HIV C7 protein coding region, the HIV 
Peptide 121 coding region orthe HIV Dreesman peptide coding region. fg 

2. A recombinant vector which comprises a recombinant DNA molecule oper«tively linked to a 
regulatory region, wherein the DNA molecule contains a functional DNA coding sequence fused, in phase, 
to a portion of the Pre 82 region of a HBV Pre 82-^ protein coding sequence wherein the regulatory 
region comprises the TDH3 promoter. 

3. A recombinant vector which comprises a recom binant DNA molecule operatively linked to a 20 
regulatory region, wherein the DNA molecule contains a functional DNA coding sequence fused, In phase, 

to a portion of the pre S2 regton of a HBV Pre S2-S protein coding sequence wherein the functional DNA 
coding sequence comprises the PreS2 coding sequence, PreSI coding sequence or entire PreS1-PreS2 
coding sequence, cireumsporozolte (CS) protein coding sequence of Piasmodlum . or a HIV coding 
sequence such as a HIV envek)pe gene sequence such as the HIV C7 protein, the HIV Peptkte 121 protein 26 
or the HIV Dreesman peptkJe. 

4. The vector of Claim 3 which comprises a 192 bp SauSA fragment coding for 16 tetrapeptlde repeats 
of the CS protein of P.f^tfcipanim wherein such fragment Is one which could be derived from Sau3A 
digestion of a 1215 bp SttJl - Rsai fragment of piasmid WR201 containing the P. fatelparum CS protein 
coding sequence minus the first 52 bp of such sequence. go 

5. The vector of Claim 4 which comprises additional Sau3A fragments of CS protein DNA. 

6. A recombinant vector which comprises a recombinant DNA molecule operatively linked to a 
regulatory region, wherein the DNA molecule contains afunctional DNA coding sequence fused, in phase, 
to a portion of the Pre S2 region of a HBV Pre S2-S protein coding sequence wherein the functtonal DNA 
coding sequence comprises all or part of the Pre S2 protein coding sequence. 3$ 

7. The vector of Qaim 6 wherein thjS functional DNA coding sequence comprises the tbUowing amino 
acid coding sequence: 



PRE-S2 REGION 



GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 
Tyr Phe Pro Ala Gly Gly Set Ser Ser Gly 



51 



40 



-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 

Met Gin Trp Asn Ser Thr Ala Phe His Gin ^ 



50 



66 



60 
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ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn He Ala Ser 

CAC ATA TCG TCA AGC TCC GC6 AGG ACT GGG 
His He ser Ser Ser Ser Ala Arg Thr Gly 

GAC CCT GTG ACG AAC 
Asp Pro Val Thr Asn* 

8. The vector of ClaJm 8 wherein the functional DMA coding sequence comprises all or part of the Pre 
S1-PreS2 protein coding sequence. 

9. The vector of Claim 8 wherein the functional DNA coding sequence comprises the following amino 
add coding sequence: 

PRE-Sl REGION 
-163 

ATQ GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG , 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pro Asp His Gin Leu Asp Pro 

GGA TTC GGA GCC AAC TCA AAC AAT CCA GAT 
Ala Phe Gly Ala Asn Sec Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Tcp Asp Phe Asn Pro lie Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA 
Pro Ala Ala Asn Gin Val Gly Val Gly Ala 
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TTC G6G CCA 666 CTC ACC CCT CCA CAC 66C 
Phe 61y Pro 61y Leu Th£ Pco Pro His 61y 

GGT ATT TTG GGG TGG AGC 'CCT GAG GOT GAG 
Gly lie Leu Gly Trp Ser Pro Glu Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly He Leu Thr Thr Val Ser Thr He Pro 

CCT CCT 6CC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Aen Arg Gin Ser Gly 

A6G CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 

PRE-S2 REGION 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Acg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 
Tyr Phe Pro Ala Gly Gly Sec Ser Ser Gly 

ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn He Ala Ser 

CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 
His He Ser Ser Ser Ser Ala Arg Thr Gly 
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GAC CCT GT6 ACQ AAC 
Asp P£o Val Th£ Asn. 



10. A eukaryotlc host cell transformed wtth a recombinant vector, wherein said vector is according to 
Claims. 

11. ThehoslofClalmlOwhIchlsayBastceD. ^. _ ^ ^/ ^ ^ 
ia A method of preparing a hybrid particle containing HBsAg protein which contains and/or presents a 

peptide encoded by afunctional DNA coding sequence which comprises culturing a eukaiyotc host cell 
transfonned with a vector in appropriate culture media and IsolaHng the particle from acell lysate or 
extract of such hosf s culture, wherein said vector is according to Claim 5. 

13. ApartlclepreparedbythemethodofClalm1Z 

14. A hybrid Immunogenic particle comprising HBsAg protein wherein said particle contains and/or 
presents a peptide encoded afunctional DNA coding sequence. 

15. A vaccine comprising an Immunoprotectlve amount of a particle according to Claim 14. 

16. A micelle comprising the particle according to Claim 14 and polysoriMte. 

17. A vaccine comprising an Immunoprotectlve amount of a micelle according to Claim 1 6. 
18 AHBVPreSI protein coding region which codes for the fbOowing amino sequence: 
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-163 

AT6 G66 AC6 AAT CTT 
Met Gly Thr Asn Leu 

6GA TTC TTT CCC GAT 
Gly Phe Pbe Pco Asp 

GCA TTC GGA GCC AAC 
Ala Phe Gly Ala Asn 

TGG GAC TTC AAC CCC 
Tcp Asp Phe ASQ Pco 

CCA GCA GCC AAC CAG 
Pco Ala Ala Asn Gin 

TTC GGG CCA GG6 CTC 
Phe Gly Pro Gly Leu 

GGT ATT TTG GGG TGG 
Gly He Leu Gly Tcp 

GGC ATA TTG ACC ACA 
Gly He Leu Thr Thr 

CCT CCT GCC TCC ACC 
Pro Pro Ala Sec The 

AGG CAG CCT ACT CCC 
Arg Gin Pro The Pro 

AGA GAC AGT CAT CCT 
Acg Asp Sec His Pco 



TCT GTT CCC AAC CCT CTG 
Sec Val Pro Asn Pra Leu 

CAT CAG TTG GAC CCT 
His Gin Leu Asp Pro 

TCA AAC AAT CCA GAT 
Ser Asn Asn Pro Asp 

ATC AAG GAC CAC TGG 
He Lys Asp His Trp 

GTA GGA GTG GGA GCA 
Val Gly Val Gly Ala 

ACC CCT CCA CAC GGC 
Thr Pro Pro His Gly 

AGC CCT CAG GCT CAG 
Ser Pro Gin Ala Gin 

GTG TCA ACA ATT CCT 
Val Ser Thr He Pro 

AAT CGG CAG TCA GGA 
Asn Arg Gin Sec Gly 

ATC TCT CCA CCT CTA 
He Sec Pco Pro Leu 

CAG GCC 
Gin Ala 



19. A recombinant DNA vector comprising the Pre 31 protein coding region of Claim 18 operaUvoly 
linked to a regulatory region. 
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20. A transformed host cell comprising the Pre S1 proteh coding region of Claim 18. 

21. ProtelnencodedbythePreS1prote!ncodlngreglonofClaim18. ^ iwe-i 

22. A vaccine comprising an Immunoprotective amount of the protein encoded by the Pre Si protein 
coding redon of Claim 18. . eo 

23. A HBV Pre S2 protein coding region or a HBV Pre S2-S protein coding retf on wherein the Pre SZ 
portion thereof Godesfbr the following amino add sequence: 

Thr 

-55 -50 or , 

Met-Gln-Trp-Asn-Ser-Thr-Ala-Phe-His-Gln-Ala-Leu-Gln-Asp. 

.40 -30 
Pro-Arg-Val-Arg-Gly-Leu-Tyr-Phe-Pro-Ala-Gly-Gly-Ser-Ser- 

Ser- 

-20 

Gly-Thr-Val-Asn-Pro-Ala-Pro-Asn-Ile-Ala-Ser-His-Ile-Ser- 



-10 

Ser-Ser-Ser-Ala-Arg-Thr-Gly-Asp-Pco-Val-Thc-Asn 

24. A recombinant DMA vector comprising the Pre S2 or the Pre S2-S protein coding region of Claim 23 

operativelyBnkedtoaregulalory region. ^/>i-i,«oa 

A transformed host cell comprising the Pre S2 of the Pre S2^ protein coding region of CW^ 

26. Protein encoded by the Pre S2 or the Pre S2-6 protein coding region of Oalm 2a _^ ^ 

27. A vaccine comprising an Immunoprotective amount of protein encoded by the Pre S2 or the Pre S2-S 
protein coding region of Claim 23. 



29. A vaccine comprising an Immunoprotective amount of a particle encoded by the Pre S2-S proteoi 
coding region of Claim 23. 

30. A micelle containing the particle of Claim 28 and polysort>ate. 

31. AvacclnecomprisinganlmmunoprotectiveamountofthemlceiieofClaim30. ^ . „ _ 

32. A recombinant DMA vector comprising a Pre Sl-Pre S2-S protein coding sequence operatively llntod 
to a regulaloiy region wherein ttie Pre SI - Pre S2 portion off such sequences codes for the f 

amino add coding sequence: 

PRE-Sl REGION 
-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu 
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GGA TTC TTT CCC GAT CAT CAO TT6 GAC OCT 
61y Phe Phe Pro Asp His Gin Leu Asp Pro 

6CA TTC 66A 6CC AAC TCA AAC AAT CCA GAT 
Ala Phe 6I7 Ala Asa Sec Asn Aan Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 
Tcp Asp Phe Asa Pco lie Lys Asp His Tcp 

CCA GCA GCC AAC CAG QTA GGA GTG GGA GCA 
Pco Ala Ala Asn Gin Val Gly Val Qly Ala 

TTC GGG CCA 6G6 CTC ACC CCT CCA CAC GGC 

Phe Gly Pco Gly Leu The Pco Pco His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Tcp Sec Pco Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly lie Leu The The Val Sec The lie Pco 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 
Pco Pco Ala Sec The Asn Acg Gin Sec Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Acg Gin Pco The Pco lie Sec Pco Pco Leu 

A6A GAC AGT CAT CCT CAG GCC 
Acg Asp Sec His Pco Gin Ala 

PRE--S2 REGION 

-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 
Met Gin Tcp Asn Sec The Ala Phe His Gin 
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6CT CTG CAG GAT CCC 
Ala Leu Gin Asp Pro 

TAT TTT CCT GCT GGT 
Tyr Phe Pro Ala Gly 

ACA GTA AAC CCT GCT 
Th£ Val Asa Pco Ala 

CAC ATA TCG TCA AGC 
His lie Ser Sec Sec 
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AGA GTC AGG GGT CTG 
Arg Val Arg Gly Leu 

G6C TCC AGT TCA GGA 
Gly Sec Ser Ser Gly 

CCG AAT ATT GCC TCT 
Pco AsxL lie Ala Sec 

TCC GCG AGG ACT GGG 
Sec Ala Acg The Gly 



GAC CCT GTG ACG 2VAC 
Asp Pco Val Thr Asn 



33. A yeast host c»fl transfonned wnh the vector of Claim 

34. Protein preparad by the PrBS1-PreS2-S sequence of the vector of Claim 32. 

35. A vacdne comprising an bnmunoprotective anriount of the protein of dabn 34. 

36. A particle comprising protein encoded by a PreS1-PreS2-S protein coding region. 

37. A vaccine comprising an immunoprotectlve amount of the particle of Claim 36. 

38. A nrrfceBe comprising the particle of Claim 36 and polysorbate. 

39. A vaccine comprislngan Immunoprotectlve amount of the micelle of Claim 38. 

40. A Pre Sl-Pre S2 protein coding sequence which comprises the fblloMring amino acid sequence: 
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Pre SI Region 
-163 

ATG GGG ACG AAT CTT TCT GTT CCC AAC CCT CTG 

Met Gly The Asn L9U Set Val P£0 Asn Pro Leu 

GGA TTC TTT CCC GAT CAT CAG TTG GAC CCT 

Gly Phe Phe Pro Asp Hie Gin Leu Asp Pro 

6CA TTC GGA GCC AAC TCA AAC AAT CCA GAT 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp 

TGG GAC TTC AAC CCC ATC AAG GAC CAC TGG 

Trp Asp Phe Asn Pro lie Lys Asp His Trp 



10 



IS 



20 



CCA GCA GCC AAC CAG G^A GGA GTG GGA GCA ^ 

Pro Ala Ala Asn Qln Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 30 

Phe Gly Pro Gly leu Thr Pro Pco His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 3S 

Gly lie Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 40 

Gly lie Leu Thr Thr Val Sec The lie Pco 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 45 

Pro Pro Ala Sec The Asn Acg Gin Sec Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA » 

Arg Gin Pro The Pco lie Sec Pco Pco Leu 

65 
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AGA GAC AGT CAT CCT CAG GCC 
Arg Asp Sex His Pro Gin Ala 



10 



Pre S2 Region 
-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC 
Met Gin Trp Asn Ser Thr Ala Pbe His 



CAA 
Gin 



IS 



GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 



TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 
Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 



25 



ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
Thr Val Asn Pro Ala Pro Asn lie Ala Ser 



CAC ATA TCG TCA AGC TCC GCG AGG ACT GGG 

His He Ser Ser Ser Ser Ala Arg Thr Gly 

GAC CCT GTG ACG AAC 

Asp Pro Val Thr Asn 



40 41. A recombinant DMA vector comprising the Pre S1-Pre S2 protein coding sequence of Claim 40 t 

operatlvely linked toaregulatory region. 

42. A yeast host cell transfonned with the vector of Claim 41 . 

43. Protein encoded by the Pre Sl-Pre S2 protein coding region of Claim 40. 

44. A vaccine comprising an Immunoprotectlve amount of protein encoded by the Pre S1-Pre S2 protein 
4S coding re^on of Claim 40. 

45. A recombinant DMA vector comprising a Pre SI functional. DMA coding sequence - Pre S2-S protein 
coding sequence operatlvely linked to an expression control sequence. 

46. A yeast host cell transforrned with the vector of Claim 45. 

47. Protein expressed by the host of Claim 46. 

SO 46. A vaccine comprising an Immunoprotectlve amount of the protein of Claim 47. 

Claims for the following contracting states : Es, Gr 

1 . A method of preparing a transformed eukaryotic host cell which comprises transforming a eukaryotic 
host cell with a vector which comprises a recombinant DNA molecule operatlvely linked to a regulatory 
SS region containing a functional DNA coding sequence fused* in phase» to a portion of the PreS2 region of a 

HBSf PrBS2-^ protein sequence wherein the functional DNA coding sequence also encodes the 
drcumsporozolte (CS) protein of Plasmodium, or a HIV envelope gene sequence such as the HIVC7 
protein, the HIV peptMe 121 protein or the HIV Dreesman peptide.. 
Z The method of aalm 1 wherein the host Is ayeast ceil. 
60 3. The method of Claim 2 wherein the host l)elongs to the genus Saccharomyces. 

4. The method of Claim 3 wherein the host belongs to the species Saccharomyces cerevislae or 
Saccharomyces cartsbergensis . 

5. The method of Claim 4 wherein the host Is strain DCS, DCS clr^, 10S44C or 10S44C or 10S44C dr*. 

6. A method of preparing a hybrid particle containing HBsAg protein which contains and/or presents a » 
6S peptkle encoded by a functional DNA coding sequence which comprises culturing a eukaryotic host cell 
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transformed wftK a vector In appropriate culture media and leolatlng the particle firom a cell iysate or 
extract of such host'e culture, wherHln said vector le a vector which comprtoes a recombinant DMA 
molecule operatlvely linked to a regutttfory region containing a functional DMA coding eequence fuaed, In 
phase, to a portion of the Pre&2 reglori of a HBV Pre S2-S protein sequence wherein the functional DMA 
coding sequence also encodes the drcumsporozolte (CS) protein of Plaemcyfium i or a HIV envelope 
gene sequence such as the HIVC7 prot&ln, the HIV peptide 121 protein brtheHIVDreesman peptide. 

7. A process of preparing a transformed host cell which comprises transfonmlng a host cell with the 
HBVPreSI protein coding region wMchccxlesfor the following amb^ 

-163 

ATO GGG ACQ AAT CTT TCT GTT CCC AAC CCT CTG 
Met 61y Thr Asn Leu Sec Val Pro Aen Pc6 Leu 

G6A TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Gly Phe Phe Pco Asp His Gin Leu Asp Pco 

GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT i 
Ala Phe Gly Ala Asa Ser Asn Asn Pco Asp 

TGG GAC TTC AAC CCC ATC AAG GAC GAC TGG 
Tcp Asp Phe Asn Pco lie Lys Asp His Tcp 

CCA GCA GCC AAC CAG 6TA GGA GTG GGA GCA 
Pco Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 
Phe Gly Pco Gly Leu The Pco Pco His Gly 

GGT ATT TTG GGG TGG AGC CCT CAG GCT CAG 
Gly lie Leu Gly Tcp Sec Pco Gin Ala Gin 
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GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 
Gly lie Leu Thr Thr Val Set Thr He Pro 



CCT CCT 6CC TCC ACC AAT CGG CAG TCA GGA 
Pro Pro Ala Ser Thr Asa Arg Gin Ser Gly 



AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 
Arg Gin Pro Thr Pro He Ser Pro Pro Leu 



AGA 6AC AGT CAT CCT CAG GCC 
Arg Asp Ser His Pro Gin Ala 



8. The process of Claim 7 %vhereln the region is comprised by a recombinant DMA vector. 

9. The process of Claim 7 wherein the host Is ayeast cell. 

ia The process of Claim 9 wherein the host belongs to the genus Saccharomyces. 

11. A process fbr preparing the protein encoded by the Pre S1 protein coding region of Claim 7 which 
comprises culturbig a host ceD transformed with the region In appropriate culture media and isolating the 
protein from a ceOlysate or extract of such host's culture. 

1 Z A process of preparing a transformed host ceil which comprises transforming a host ceil with a HBV 
PreS2 protein oocBng region or a HBV Pre82-8 protein coding region wherein the PrBS2 portion thereof 
codes fbrthe following amino acid ref^nce: 

Thr 

-55 -50 or 

Met-Gln-Trp-Asn-Ser-Thr-Ala-Phe-His-Gln-Ala^Leu-Gln-Asp- 

-40 -30 
Pro-Arg-Val-Arg-Gly-Leu-Tyr-PHe^Pro-Ala^Gly-Gly-Ser-Ser- 

Ser- 

-20 

Gly-Tlir-Val-Asn-Pro-Ala-Pro-Asn-Ile-Ala-Ser-His-lle-Ser- 



-10 

Ser-Ser-Ser-Ala-Arg-Thr-Gly-Asp-Pro-Val-Thr-Asn 



13. The process of aalm 12 wherein the region is comprised by a recombinant DMA vector. 

14. A process for preparing protein encoded by the Pre S2 or the Pre S2-S protein coding region of 
Claim 12 which comprises culturing a host cell transfomied wfth the region In appropriate culture media 
and isolating the protein from a cell tysate or extract of such host's culture. 

15. A process for preparing a particle encoded by the Pre S2-S protein coding region of Qalm 12 which 
comprises culturing a host cell transfomied with the region In appropriate culture media and isolating the 
protein from a ceD lysate or extract of such host's culture. 

16. A method of preparing a transformed yeast host cell wWch comprises transfomning a yeast host cell 
with a recombinant DNA vector comprising a PrBS1-PreS2-S protein coding sequence operatlveiy linlced 
to a regulatory region. 

17. A method of preparing protein encoded by the Pre Sl-Pre S2-S protein coding sequence which 
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comprises cu!turtng a yeast host ceO transformed with recomblratfit DMA vector comprising a 
Pre31-PreS2-S protein coding im^Mnce operatlvely linked to a regulatoiy region In appropriate cufture 
media and isolating the protein frorn tt ceil lysate or extract of such host's culture. 

18. A method of preparing a particie containing polypeptide- encoded i}y the Pre S1-Pre S2-S protein 
coding sequence which comprises culturing a yeast host cell transformed with a recombinant DMA vector 5 
comprising a PreS1->PreS2-S prbtdth coding sequence bperatively Ibilced to a regutettory region In 
appropriate culture media and isolatihg the particle from a cell lysate or extract of such host's culture. 

19. A method of preparing a transformed yeast host ceil which comprises transforming a yeast host ceH 
with a recombinant DNA vector comprising ihd fbilowing PreS1-S2-proteln coding sequence opemtiveiy 
iini<ed to a regulatory region. 10 

Pre SI Region 

ATG 666 AC6 
Met 61y Thr 

66A TTC TTT 
61y Phe Phe 

6CA TTC 6GA 
Ala Phe 61y 

30 
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AAT CTT tCT 6TT CCC AAC CCT CTG 

Asn Leu Set Val Pro Asa Pro Leu 

CCC eAT CAT CAC TTC CAC CCT 

Pco Asp His 6Ln Leu Asp Pro 

CCC AAC TCA AAC AAT CCA GAT 

Ala Aaii Se£ Asn Asa Pro Asp 
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TGG GAC TTC AAC CCC ATC . AAG GAC CAC TGG 

Trp Asp Phe Asn Pro lie Lys Asp His Trp 

CCA GCA GCC AAC CAG GTA G6A. GTG GGA GCA 

Pro Ala Ala Asn Gin Val Gly Val Gly Ala 

TTC GGG CCA GGG CTC ACC CCT CCA CAC GGC 

Phe Gly Pro Gly leu Thr Pro Pro His Gly 

GGT ATT TTG GGG TGG AGO CCT CAG GCT CAG 

Gly He Leu Gly Trp Ser Pro Gin Ala Gin 

GGC ATA TTG ACC ACA GTG TCA ACA ATT CCT 

Gly He Leu Thr Thr Val Ser Thr He Pro 

CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA 

Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly 

AGG CAG CCT ACT CCC ATC TCT CCA CCT CTA 

Arg Gin Pro Thr Pro He Ser Pro Pro Leu 

AGA GAC AGT CAT CCT CAG GCC 

Arg Asp Ser His Pro Gin Ala 

Pre S2 Region 
-55 

ATG CAG TGG AAT TCC ACT GCC TTC CAC CAA 

Met Gin Trp Asn Ser Thr Ala Phe His Gin 

GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 

Ala Leu Gin Asp Pro Arg Val Acg Gly Leu 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA 

Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly 



0278940 



ACA GTA AAC CCT OCT CCQ AAT ATT GCC TCT 

Thr Val Asn Pro Ala Pro Aan lie Ala Ser 

CAC ATA TC6 TCA AGC TCC 6C6 AGG ACT GGG 

His lie Ser Ser Ser Ser Ala Arg Thr Gly 

GAC CCT GTG ACG AAC 

Asp Pro Val Thr Abd 

20. A process for preparing protein encoded by the Pre SI -Pre S2 protein coding region of Claim 1 9 
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Rgure 1 
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BamHI 
Hinc II 




1 I HBV DNA 



Rgure 2 
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Figure 5 
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Xbal 
T4 DMA polymerase 



pMC200 
7660bp 



T4/Xbal Hind 



Hinc II 




Xhol 



I Hind|ll/T4/Xba I | EcqF^I[ Hinc II 



BstE II 



Hinc II 



Xhol 
I BstE 



y///////A 



T4/Xba 



W//////M. 



'////////////A 



EcoRI 



EcoRI 



1940bp 

|Bal31 
rr4DNA 

ECORI tP^'y^®"^® 



W/////A 



I 



5700bp 




1480bp 



Hinc 11 



EcoRI Hind 111 




Xhol 
BstE II 

Xbal 



Hinc II 



Hind III 



'W////M 



pBR322 
arg3 



Figure 6 
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PRIT10749 

Xbal 
T4 DNA polymerase 



pRm0601 
jHhai 

Hha 1 Xba i Hpa I Hha t 

I I I . I 



Hind III 



1100bp 

T4 DNA polymerase 




YE^Hindlil 



I ipRIT10749 
HBVDNA 
YEp 13AHindlll 



Figure 7 
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1 




Fig. 8a 
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Hind III TDH3 




Rgure 9 



r 



0278 



Hind III 



TDH3 



BamHI 
Smal 

EcoRI 




Figure 10 



02/8940 



11 80 bp EcoRI fragment 
from pRlTl2288 



Accl-EcoRI 
Synthetic 




Hind III - AccI fragment 
from PRIT12230 



l^^gureil 
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r 



Figure 12b 



PRIT12309 
Cla. Sal I 



EcoRI 



EcoRi 



2830 bp Clal - Sal I 
fragment from 
PRTT12290 



EcoRI 



Clal 



BamHI 
EcoRI 
Sac II 




EcoRI 



Sail 



2218 bp Xhol-Sall 
LEU2 fragment from 
YEP13 



EcoRi 



BamHI 
EcoRI 
SAC II 




X (Sal l/Xho I) 



EcoRI 



Figure 14 
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Construction of pRITt2658 



redone 3328bp Hind III fragment 
of pRIT 12574 on pBR322ABam. 



Hind III . Hind III 




Figure 15 
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Hind Hi TDH3 




Sail 



Rgure 16 
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pRIT12309 
Sail 




2218 bp LEU2 
Xho I - Sail fragment 
from YEp13 



BamHI 

X{Sall/Xho I) 

Leu2 

EcoRI 



Sail 



EcoRI 



BamHI. 
T4 polymerase 



3050 bp Hind I 
fragment from 



T4 polymerase PRIT12662 



EcoRI X 



EcoRI 



BamHI 
EcoRI 



EcoRI 




X (Sal l/Xho I) 



EcoRI 



EcoRI 
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Plasmid pRIT12314 



- Digested BamHI, Sma I 
• Mung Bean nuclease heated 



«Tnm^>. C ATG 




Plasmid pRIT12792 



J 



GGG ACQ 



Digested Ncoi-Xba I . 
Mung bean nuclease treated 
495 Bp fragment purified 

BamHI 



PreS1 PreS2 



BamHI 
preS2 



Sail 

* digested BAMHI 

- digested Sal I 

- purification of large 10400 bp 
fragment 



Plasmid pRIT12660 



- digested BamHI, Sal 

- small 4430 bp fragment 
purified 




BamHI 

preS2-S 



Figure 18 



Sail 



Construction of pRIT1266 
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reconstmction of the 
N-terminal coding 
region of preS2-S 
protein 



Plasmid 



pRiT22gO 



t 




Adaptor 

BamHI EcoRI 
C3ATC CAG TGG 

GTC ACC TT AA 

insertion of the adaptor between the 
BamHI and EcoRI sites of pRIT122go 



ATG GATC CAG TGG |AAT TC 



TAC CTAG GTC ACC TTA AG 
BamHI EcoRI 



construction of an 
E. coli vector 
containing the 
preS2-S expression 
cassette 



- digestion with BamHI 

- digestion with mung bean nuclease 

- li9ation 

- digestion with EcoRI 



Acci 



Plasmid 
pRm2581 



m/i7/////mim. 



EcoRI EcoRi 

' insertion of the EcoRI fraament containing 
the C-tenninal coding region of preS2-S protein 

2 3 



Hind III 




ATG CAG TGG 
TAC GTC ACC 

preS2-S 



AAT TC 
TTA AG 



Hind III 



Figure 1A 
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Figure 2A 
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mR1 mPf15 
AAAAAAGAAAATTATAAATAAATATATATATTCGTGTAAAAAT 

mR5/8 

50 mPf13 
AAGTAGAAACCACGTATATTATAAATTACAATTC ATG ATG 

Met Met 

100 



AGA 


AAA 


TTA 


GOT 


ATT 


•TTA 


TCT 


GTT 


TCT 


Arg 


Lys 


L6U 


Ala 


lie 


Leu 


Ser 


Val 


Ser 


TCC 


TTT 


TTA 


TTT 


GTT 


GAG 


GCC 


TTA 


TTC 


Ser 


Phe 


Leu 


Phe 


Val 


Glu 


Ala 


Leu 


Phe 


CAG 


GAA 


TAG 


CAG 


150 
TGC 


TAT 


GGA 


AGT 


TCQ 


Gin 




Tyr 


Gin 


Cys 


lyr 


v3iy 


Ser 


Ser 


TCA 


AAC 


ACA 


AGG 


GTT 


GTA 


AAT 


GAA 


TTA 


Ser 


Asn 


Thr Arg 


Val 


Leu 


Asn 


Glu 


Leu 


AAT 


TAT 


200 
GAT AAT 


GCA 


GGC 


ACT 


AAT 


TTA 


Asn 


Tyr 


Asp 


Asn 


Ala 


Giy 


Tlir 


Asn 


Leu 


TAT 


AAT 


GAA 


TTA 


GAA 


ATG 


AAT 


TAT 


TAT 


Tyr 


Asn 


Giu 


Leu 


Glu 


Met 


Asn 


Tyr 


Tyr 


GGG 


250 
AAA 


CAG 


GAA 


AAT 


TGG 


TAT 


AGT 


CTT 


Gly 


Lys 


Gin 


Glu 


Asn 


Trp 


Tyr 


Ser 


Leu 


AAA 


AAA 


AAT 


AGT 


AGA 


TCA 


CTT 


GGA 


GAA 


Lys 


Lys 


Asn 


Ser 


Arg 


Ser 


Leu 


Gly 


Glu 


300 
AAT 


GAT 


GAT 


GGA 


AAT 


AAT 


AAT 


AAT 


GGA 


Asn 


Asp 


Asp 


Gly 


Asn 


Asn 


Asn 


Asn 


Giy 






GGT 


CGT 


GAA 






350 


GAT 


AAT 


GGT 


AAA 


GAT 


GAA 


Asp 


Asn 


Giy 


Arg 


Glu 


Gly 


Lys 


Asp 


Glu 


GAT 


AAA 


AGA. 


GAT 


GGA 


AAT 


AAC 


GAA 


GAC 


Asp 


Lys 


Arg 


Asp 


Giy 


Asn 


Asn 


Glu 


Asp 



Fig. 3A a 
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AAC 


GAG 


AAA 


TTA 


AGG 


AAA 

AAA 


CCA 


AAA 

AAA 


AT 

CAT 


Asn 


Glu 


Lys 


Leu 


Arg 


Lys 


Pro 


Lys 


rilS 






REGION t 






GGG 


GAT 


GGT 


AAA 


AAA 


TTA 


AAG 


CAA 


CCA 


Lys 


Lys 


Leu 


Lys 


Gin 


Pro 


(jiiy 


Asp 


vaiy 










450 




CCA 


AAT 


AAT 


cct' 


GAT 


CCA 


AAT 


GCA 


A 


Asn 


Pro 


Asp 


Pro 


Asn 


Aia 


Msn 






GTA 


GAT 


CCC 


AAT 


GCC 


AAC 


CCA 


AAT 


GTA 


|Val 


Aspl 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


|Vai 








bOO 




AAT 


GTA 




GAT 


CCA 


AAT 


GCA 


AAC 


CCA 


GAT 


Asp 1 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 




ASD 1 


CCA 


AAT 


GCA 


AAC 


CCA 


AAT 


GCA 


AAC 


CCA 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


AAT 


GCA 


550 

AAC 


CCA 


AAT 


GCA 


AAC 


CCA 


AAT 


Asn 


Aia 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 



GCA AAC CCA AAT GCA AAC CCA AAT GCA 
Ala Asn Pro Asn Ala Asn Pro Asn Ala 



AAC CCA AAT GCA AAC CCA AAT GCA AAC 

Asn Pro Asn Ala Asn Pro Asn Ala Asn 

— ' ^50 

CCC AAT GCA AAT CCT AAT GCA AAT CCT 

Pro Asn Ala Asn Pro Asn Ala Asn Pro 

AAT GCA AAC CCA AAT GCA AAT CCT AAT 

Asn Ala Asn Pro Asn Ala Asn Pro Asn 

" 700 

GCA AAC CCA AAT GCA AAC CCA AAC GTA 

Ala Asn Pro Asn Ala Asn Pro^ Asn I val 

GAT CCT AAT GCA AAT CCA AAT GCA AAC 

Asp| Pro Asn Ala Asn Pro Asn Ala Asn 

Fig. 3A b 
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750 



CCA 


AAT 


GCA 


AAC 


CCA 


AAC 


GCA 


AAC 


CCC 


Pro 


Asn 


Ala 


Asn 


Pr9 


Asn 


Ala 


Asn 


Pro 


AAT 


GCA 


AAT 


CCT 


AAT 


GCA 


AAC 


CCC 


AAT 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 










SDD" 










GCA 


AAT 


CCT 


AAT 


GCA 


AAT 


CCT 


AAT 


GCC 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


AAT 


CCA 


AAT 


GCA 


AAT 


CCA 


AAT 


GCA 


AAC 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


















CCA 


AAC 


GCA 


AAC 


CCC 


AAT 


GCA 


AAT 


CCT 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


AAT 


GCC 


AAT 


CCA 


AAT 


GCA 


AAT 


CCA 


AAT 


Asn 


Ala 


Asn 


Pro 


Asn 


A 

Ala 


Asn 


Pro 


Asn 




yuu 














GCA 


AAC 


CCA 


AAT 


GCA 


AAC 


CCA 


AAT 


GCA 


Ala 


Asn 


Pro 


Asn 


Ala 


Asn 


Pro 


Asn 


Ala 


AAC 


CCC 


AAT 


GCA 


AAT 


CCT 


AAT 


AAA 


AAC 


Asn 


Prp 


Asn 


Ala 


Acn 


r ro 


Asn 


Lys 


Asn 


















AAT 


CAA 


GGT 


AAT 


GGA 


CAA 


GGT 


CAC 


AAT 


Asn 


Gin 


Gly 


Asn 


Gly 


Gin 


Gly 


His 


Asn 










1000 


ATG 


CCA 


AAT 


GAC 


CCA 


AAC 


CGA 


AAT 


GTA 


Met 


Pro 


Asn 


Asp 


Pro 


Asn 


Arg 


Asn 


Vai 


GAT 


GAA 


AAT 


GCT 


AAT 


GCC 


AAC 


AAT 


GCT 


Asp 


Glu 


Asn 


Ala 


Asn 


Ala 


Asn 


Asn 


Ala 












.1050 




GTA 


AAA 


AAT 


AAT 


AAT 


AAC 


GAA 


GAA 


CCA 


Val 


Lys 


Asn 


Asn 


Asn 


Asn 


Glu 


Glu 


Pro 


AGT 


GAT 


AAG 


CAC 


ATA 


GAA 


CAA 


TAT 


TTA 


Ser 


Asp 


Lys 


His 


lie 


Glu 


Gin 


Tyr 


Leu 



Fig. 3A c 
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1100 



AAG 


AAA 


ATA 


AAA 


AAT 


TCT 


ATT 


TCA 


ACT 


Lvs 




lie 


Lys 


Asn 


Ser 


Ha 


oer 


Thr 

1 nr 








REGION II 








GAA 


TGG 


TCC 


CCA 


1 o 1 


Ao 1 


GTA 


ACT 


TGT 


Glu 


Tip 


Ser 


Pro 


uys 


oer 


\/ol 

Val 


1 nr 


uys 


GGA 


AAT 


GGT 


AAT 


CAA 


GTT 


MVaM 


ATA 


AAn 


Gly 


Asn 


Gly 


lie 




1 
1 


Arg 


116 


1 %ie 

LyS 


CCT 


GQC 


TOT 


GCT 


AAT 


AAA 


CCT 


AAA 


GAC 


Pro 


Gly 


Ser 


Ala 


Asn 


Lys 


Pro 


Lys 


Asp 






1200 






GAA 


TTA 


GAT 


TAT 


GAA 


AAT 


GAT 


ATT 


GAA 


Glu 


Leu 


Asp 


Tyr 


Glu 


Asn 


Asp 


lie 


Glu 


AAA 


AAA 


ATT 


TGT 


AAA 


ATG 


GAA 


AAA 


TGT 


Lys Lys 
1250 


lie 


Cys 


Lys 


Met 


Glu 


Lys 


Cys 


TCC 


AGT 


GTG 


1 1 1 


AAT 


GTC 


GTA 


AAT 


AGT 


Ser 


Ser 


Val 


Phe 


Asn 


Val 


Val 


Asn 


Ser 


TCA 


ATA 














1300 


GGA 


TTA 


ATA 


ATG 


GTA 


TTA 


TCC 


Ser 


lie 


Gly 


Leu 


lie 


Met 


Val 


Leu 


Ser 


TTC 


TTG 


TTC 


CTT 


AAT 


TAG 


ATAAAGAACACAT 


Phe 


Leu 


Phe 


Leu 


Asn 











1350 

CTTAGTTTGAGTTGTACAATATTTATAAAAATATATACTACTTT 



1400 

TTTTCTTAATTTTCATTTTTCTTTATATTTTCCGAGGGAATTTAT 

1450 

11 I I I IGTGAATATTTAATTACGTTTGCGATTAATTGTAGAAAT 

1500 

ATATATGTATATACTATATTTATAGAATGTGTTATTCTCAAAAA 

Pig. 3A a 



0278940 



1550 

CAACAACAAAAAAAAAAAAAAAAAAAAAAAAAAGAAAAAAGG 

ATTAAAAGTAAAATAGTTATAAATATTTTCAAAAATATTTATAA 
1600 

cacaaaaaatacttcgaagttcatItaacatttttgtttattt 

1650 

ATTTATTTATATATTTCAI I I I I ACGTATTTAT ATTATAAAATG 

1700 

GTGTATCTTAAAAATAGTGAACTATATATATAAAATATTAATT 

1750 

TAAAAAAATTATAACTTTCI I I 1 1 ATTTTCTAAAATAAQTTAAA 

1800 

AATTATATGTTTAAGAAAGGGGTAAATTATAATATTTQTATAA 

1850 

ATATATAAACATAGATATATTAAATAAAATAACAAATGTACTA 

TATTTGTGCATAAGACGTATACGCTTTATATAATACAACAATA 

TTAATTGTAATAATATTTGTGGTAGTGTGAACACTAAAATTGA 
1950 

TAATAATGATTATAATACAGAAGAAATAAAAAATGAATCCAAT 

2000 

ATAGGATTTACAACAAATATTCATGAAGCAAAAATAATTCAAG 

2050 

AAAAGACATATGGATTAATAATAAACGATAAAATAAAGAAAG 

Pig. 3A e 
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2100 

AAGAATATGATGATTGTAATAATAATAATAATAATAATATTAT 

2150 

AATACAGATAAGAGAAGTTGGACTTAATTATTTTGGAGATAC 



TCTCGATGAATCGAATCCATGTAATGATCTTACAGGTATTAA 
2200 

TATATGGGAAAGTTGTCTTGTGGCTAGTCGATGGTTTAGCG 
2250 

ATTTATCTTTACAGAAI I 1 1 I I I I CGAATAAAAATATTTTAGA 

2300 

AATTGGTGCTGGCAGTGGTTTGGCTAGTATAATAATATTTAT 



ATATTCTAATATTTACAA 



Pig. 3A f 



02789A0 

Sequence of the Pre si- Pre S2 region of pRIT 12792 



PRE-S1 REGION 
Ncol 



-163 

COATG 
Met 


GGG 
Gly 


ACG 
Thr 


AAT 
Asn 


CTT 
Leu 


TCT 
Ser 


GTT 
Val 


CCC 
Pro 


AAC 
Asn 


CCT 
Pro 


CTG 
Leu 


GGA 
Gly 


TTC 
Phe 


TTT 
Phe 


CCC 
Pro 


GAT 
Asp 


CAT 
His 


CAG 
Gin 


TTG 
Leu 


GAC 
Asp 


CCT 
Pro 


GCA 
Ala 


TTC 
Phe 


GGA 
Gly 


GCC- 
Ala 


AAC 
Asn 


TCA 
Ser 


AAC 
Asn 


AAT 
Asn 


CCA 
Pro 


GAT 
Asp 


TGG 
Trp 


GAC 
Asp 


TTC 
Phe 


AAC 
Asn 


CCC 
Pro 


ATC 
lie 


AAG 
Lys 


GAC 
Asp 


CAC 
His 


TGG 
Trp 


CCA 
Pro 


GCA 
Ala 


GCC 
Ala 


AAC 
Asn 


CAG 
Gin 


GTA 
Val 


GGA 
Gly 


GTG 
Val 


GGA 
Gly 


GCA 
Ala 


TTC 
Phe 


GGG 
Gly 


CCA 
Pro 


GGG 
Gly 


CTC 
Leu 


ACC 
Thr 


CCT 
Pro 


CCA 
Pro 


CAC 
His 


GGC 
Gly 


GGT 
Gly 


ATT 
lie 


TTG 
Leu 


GGG 
Gly 


TGG 
Trp 


AGC 
Ser 


CCT 
Pro 


CAG 
Gin 


GCT 
Ala 


CAG 
Gin 


GGC 
Gly 


ATA 
He 


TTG 
Leu 


ACC 
Thr 


ACA 
Thr 


GTG 
Val 


TCA 
Ser 


ACA 
Thr 


ATT 
lie 


CCT 
Pro 


CCT 
Pro 


CCT 
Pro 


GCC 
Ala 


TCC 
Ser 


ACC 
Thr 


AAT 
Asn 


CGG 
Arg 


CAG 
Gin 


TCA 
Ser 


GGA 
Gly 


AGG 
Arg 


CAG 
Gin 


CCT 
Pro 


ACT 
Thr 


CCC 
Pro 


ATC 
lie 


TCT 
Ser 


CCA 
Pro 


CCT 
Pro 


CTA 
Leu 


AGA 
Arg 


GAC 
Asp 


AGT 
Ser 


CAT 
His 


CCT 
Pro 


CAG 
Gin 


GCC 
Ala 



Fig. 4A a 

J) 



••*6'27 8&A0 



PRE-S2 REGION 



-55 

ATG 

Met 


CAG 
Gin 


TGG 
Trp 


AAT 
Asn 


TCC 
Ser 


ACT 
Thr 


GCC 
Ala 


TTC 
Phe 


CAC 
His 


CAA 
Gin 


GCT 
Ala 


CTG 
Leu 


CAG 
Gin 


GAT 
Asp 


CCC 
Pro 


AGA 
Arg 


GTC 
Val 


AGG 
Arg 


GGT 
Gly 


CTG 
Leu 


TAT 
Ty r 


TTT 
Phe 


CCT 
Pro 


GCT 
Ala 


GGT 
Gly 


GGC 
Gly 


TCC 
Ser 


AGT 
Ser 


TCA 
Ser 


GGA 
Gly 


ACA 
Thr 


GTA 
Val 


AAC 
Ash 


CCT 
Pro 

• 


GCT 
Ala 


CCG 
Pro 


AAT 
Asn 


ATT 
lie 


GCC 
Ala 


TCT 
Ser 


CAC 
His 


ATT 
lie 


TCG 
Ser 


TCA 
Ser 


AGC 
Ser 


TCC 
Ser 


GCG 
Ala 


AGG 
Arg 


ACT 
Thr 


GGG 
Gly 


GAC 
Asp 


CCT 
Pro 


GTG 
Val 


ACG 
Thr 



Xbal. 
AA TAATAATCTAGA 



Fig. 4a b 
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5' 



Bgl II 



.Mboil 3' 



117 bp fragment fror 
pRITl2901 



Mung bean nuclease 



TTC 
AAG 



I 



QTQC 
CACG 



.pRIT10911/BamHI/ 
mung bean nuclease 



EC7-preS2-S 
ARG VAL PRO 
AGG GTGC CC 

TACCTCCTCCTCTATACTCC CACG GG 




preS2-S 



pTDH3 



Figure 2B 



0278940 



Hha I - Hind III fragment (230bp) op pRIT12901 




Rgure 3B 



02789A0 



p. Dreesman (Synthetic) 



GAT CTCTGAC AGG 

ag[ctg TCC 

ASP ARG 



AGA TCClCCG 
TCT AGg|gGC CTA G 
ARG SER 



MET ASP LEU ASP 
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